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Preface to the Third Edition 


The second edition of Statistical Mechanics was published in 1996. The new material added at 
that time focused on phase transitions, critical phenomena, and the renormalization group — 
topics that had undergone vast transformations during the years following the publication of 
the first edition in 1972. In 2009, R. K. Pathria (R.K.E) and the publishers agreed it was time for 
a third edition to incorporate the important changes that had occurred in the field since the 
publication of the second edition and invited Paul B. Beale (P.D.B.) to join as coauthor. The two 
authors agreed on the scope of the additions and changes and P.D.B. wrote the first draft of 
the new sections except for Appendix F which was written by R.K.P. Both authors worked very 
closely together editing the drafts and finalizing this third edition. 

The new topics added to this edition are: 

• Bose-Einstein condensation and degenerate Fermi gas behavior in ultracold atomic gases: 
Sections 7.2, 8.4,11.2.A, and 11.9. The creation of Bose-Einstein condensates in ultracold 
gases during the 1990s and in degenerate Fermi gases during the 2000s led to a revolution 
in atomic, molecular, and optical physics, and provided a valuable link to the quantum 
behavior of condensed matter systems. Several of P.D.B.’s friends and colleagues in physics 
and JILA at the University of Colorado have been leaders in this exciting new field. 

• Finite-size scaling behavior of Bose-Einstein condensates: Appendix F. We develop an 
analytical theory for the behavior of Bose-Einstein condensates in a finite system, which 
provides a rigorous justification for singling out the ground state in the calculation of the 
properties of the Bose-Einstein condensate. 

• Thermodynamics of the early universe: Chapter 9. The sequence of thermodynamic 
transitions that the universe went though shortly after the Big Bang left behind mileposts 
that astrophysicists have exploited to look back into the universe’s earliest moments. Major 
advances in astronomy over the past 20 years have provided a vast body of observational 
data about the early evolution of the universe. These include the Hubble Space Telescope’s 
deep space measurements of the expansion of the universe, the Cosmic Background 
Explorer’s precise measurements of the temperature of the cosmic microwave background, 
and the Wilkinson Microwave Anisotropy Probe’s mapping of the angular variations in the 
cosmic microwave background. These data sets have led to precise determinations of the 
age of the universe, its composition and early evolution. Coincidentally, P.D.B.’s faculty 
office is located in the tower named after George Gamow, a member of the faculty at the 
University of Colorado in the 1950s and 1960s and a leader in the theory of nucleosynthesis 
in the early universe. 

• Chemical equilibrium: Section 6.6. Chemical potentials determine the conditions 
necessary for chemical equilibrium. This is an important topic in its own right, but also 
plays a critical role in our discussion of the thermodynamics of the early universe in 
Chapter 9. 
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• Monte Carlo and molecular dynamics simulations: Chapter 16. Computer simulations have 
become an important tool in modern statistical mechanics. We provide here a brief 
introduction to Monte Carlo and molecular dynamics techniques and algorithms. 

• Correlation functions and scattering: Section 10.7. Correlation functions are central to the 
understanding of thermodynamic phases, phase transitions, and critical phenomena. The 
differences between thermodynamic phases are often most conspicuous in the behavior 
of correlation functions and the closely related static structure factors. We have collected 
discussions from the second edition into one place and added new material. 

• Fluctuation-dissipation theorem and the dynamical structure factor: Sections 15.3.A., 

15.6.A, and 15.6.B. The fluctuation-dissipation theorem describes the relation between 
natural equilibrium thermodynamic fluctuations in a system and the response of the 
system to small disturbances from equilibrium, and it is one of the cornerstones of 
nonequilibrium statistical mechanics. We have expanded the discussion of the 
fluctuation-dissipation theorem to include a derivation of the key results from linear 
response theory, a discussion of the dynamical structure factor, and analysis of the 
Brownian motion of harmonic oscillators that provides useful practical examples. 

• Phase equilibrium and the Clausius-Clapeyron equation: Sections 4.6 and 4.7. Much of the 
text is devoted to using statistical mechanics methods to determine the properties of 
thermodynamic phases and phase transitions. This brief overview of phase equilibrium 
and the structure of phase diagrams lays the groundwork for later discussions. 

• Exact solutions of one-dimensional fluid models: Section 13.1. One-dimensional fluid 
models with short-range interactions do not exhibit phase transitions but they do display 
short-range correlations and other behaviors typical of dense fluids. 

• Exact solution of the two-dimensional Ising model on a finite lattice: Section 13.4.A. This 
solution entails an exact counting of the microstates of the microcanonical ensemble and 
provides analytical results for the energy distribution, internal energy, and heat capacity of 
the system. This solution also describes the finite-size scaling behavior of the Ising model 
near the transition point and provides an exact framework that can be 

used to test Monte Carlo methods. 

• Summary of thermodynamic assemblies and associated statistical ensembles: Appendix H. 
We provide a summary of thermodynamic relations and their connections to statistical 
mechanical ensembles. Most of this information can be found elsewhere in the text, but we 
thought it would be helpful to provide a rundown of these important connections in one 
place. 

• Pseudorandom number generators: Appendix I. Pseudorandom number generators are 
indispensable in computer simulations. We provide simple algorithms for generating 
uniform and Gaussian pseudorandom numbers and discuss their properties. 

• Dozens of new homework problems. 

The remainder of the text is largely unchanged. 

The completion of this task has left us indebted to many a friend and colleague. R.K.P. has 
already expressed his indebtedness to a good number of people on two previous occasions — 
in 1972 and in 1996 — so, at this time, he will simply reiterate the many words of gratitude he 
has already written. In addition though, he would like to thank Paul Beale for his willingness to 
be a partner in this project and for his diligence in carrying out the task at hand both arduously 
and meticulously. 

On his part, P.D.B. would like to thank his friends at the University of Colorado at Boulder 
for the many conversations he has had with them over the years about research and pedagogy 
of statistical mechanics, especially Noel Clark, Tom DeGrand, John Price, Chuck Rogers, Mike 
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Dubson, and Leo Radzihovsky. He would also like to thank the faculty of the Department of 
Physics for according him the honor of serving as the chair of this outstanding department. 

Special thanks are also due to many friends and colleagues who have read sections of 
the manuscript and have offered many valuable suggestions and corrections, especially Tom 
DeGrand, Michael Shull, David Nesbitt, Jamie Nagle, Matt Glaser, Murray Holland, Leo Radzi¬ 
hovsky, Victor Gurarie, Edmond Meyer, Matthew Grau, Andrew Sisler, Michael Foss-Feig, Allan 
Franklin, Shantha deAlwis, Dmitri Reznik, and Eric Cornell. 

P.D.B. would like to take this opportunity to extend his thanks and best wishes to Professor 
Michael E. Fisher whose graduate statistical mechanics course at Cornell introduced him to this 
elegant field. He would also like to express his gratitude to Raj Pathria for inviting him to be part 
of this project, and for the fun and engaging discussions they have had during the preparation 
of this new edition. Raj’s thoughtful counsel always proved to be valuable in improving the text. 

P.D.B.’s greatest thanks go to Matthew, Melanie, and Erika for their love and support. 

R.K.P. 

P.D.B. 



Preface to the Second Edition 


The first edition of this book was prepared over the years 1966 to 1970 when the subject of phase 
transitions was undergoing a complete overhaul. The concepts of scaling and universality had 
just taken root but the renormalization group approach, which converted these concepts into 
a calculational tool, was still obscure. Not surprisingly, my text of that time could not do justice 
to these emerging developments. Over the intervening years I have felt increasingly conscious 
of this rather serious deficiency in the text; so when the time came to prepare a new edition, my 
major effort went toward correcting that deficiency. 

Despite the aforementioned shortcoming, the first edition of this book has continued to 
be popular over the last 20 years or so. I, therefore, decided not to tinker with it unnecessar¬ 
ily. Nevertheless, to make room for the new material, I had to remove some sections from the 
present text which, I felt, were not being used by the readers as much as the rest of the book was. 
This may turn out to be a disappointment to some individuals but I trust they will understand 
the logic behind it and, if need be, will go back to a copy of the first edition for reference. I, on 
my part, hope that a good majority of the users will not be inconvenienced by these deletions. 
As for the material retained, I have confined myself to making only editorial changes. The sub¬ 
ject of phase transitions and critical phenomena, which has been my main focus of revision, 
has been treated in three new chapters that provide a respectable coverage of the subject and 
essentially bring the book up to date. These chapters, along with a collection of more than 60 
homework problems, will, I believe, enhance the usefulness of the book for both students and 
instructors. 

The completion of this task has left me indebted to many. First of all, as mentioned in 
the Preface to the first edition, I owe a considerable debt to those who have written on this 
subject before and from whose writings I have benefited greatly. It is difficult to thank them 
all individually; the bibliography at the end of the book is an obvious tribute to them. As for 
definitive help, I am most grateful to Dr Surjit Singh who advised me expertly and assisted me 
generously in the selection of the material that comprises Chapters 11 to 13 of the new text; 
without his help, the final product might not have been as coherent as it now appears to be. On 
the technical side, I am very thankful to Mrs. Debbie Guenther who typed the manuscript with 
exceptional skill and proof read it with extreme care; her task was clearly an arduous one but 
she performed it with good cheer — for which I admire her greatly. 

Finally, I wish to express my heartfelt appreciation for my wife who let me devote myself 
fully to this task over a rather long period of time and waited for its completion ungrudgingly. 

R.K.P. 
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Preface to the First Edition 


This book has arisen out of the notes of lectures that I gave to the graduate students at 
the McMaster University (1964-1965), the University of Alberta (1965-1967), the University of 
Waterloo (1969-1971), and the University of Windsor (1970-1971). While the subject matter, in 
its finer details, has changed considerably during the preparation of the manuscript, the style 
of presentation remains the same as followed in these lectures. 

Statistical mechanics is an indispensable tool for studying physical properties of matter 
“in bulk” on the basis of the dynamical behavior of its “microscopic” constituents. Founded 
on the well-laid principles of mathematical statistics on one hand and Hamiltonian mechanics 
on the other, the formalism of statistical mechanics has proved to be of immense value to the 
physics of the last 100 years. In view of the universality of its appeal, a basic knowledge of this 
subject is considered essential for every student of physics, irrespective of the area(s) in which 
he/she may be planning to specialize. To provide this knowledge, in a manner that brings out 
the essence of the subject with due rigor but without undue pain, is the main purpose of this 
work. 

The fact that the dynamics of a physical system is represented by a set of quantum states 
and the assertion that the thermodynamics of the system is determined by the multiplicity of 
these states constitute the basis of our treatment. The fundamental connection between the 
microscopic and the macroscopic descriptions of a system is uncovered by investigating the 
conditions for equilibrium between two physical systems in thermodynamic contact. This is 
best accomplished by working in the spirit of the quantum theory right from the beginning; 
the entropy and other thermodynamic variables of the system then follow in a most natural 
manner. After the formalism is developed, one may (if the situation permits) go over to the 
limit of the classical statistics. This message may not be new, but here I have tried to follow it as 
far as is reasonably possible in a textbook. In doing so, an attempt has been made to keep the 
level of presentation fairly uniform so that the reader does not encounter fluctuations of too 
wild a character. 

This text is confined to the study of the equilibrium states of physical systems and is 
intended to be used for a graduate course in statistical mechanics. Within these bounds, the 
coverage is fairly wide and provides enough material for tailoring a good two-semester course. 
The final choice always rests with the individual instructor; I, for one, regard Chapters 1 to 9 
[minus a few sections from these chapters plus a few sections from Chapter 13) as the “essential 
part” of such a course. The contents of Chapters 10 to 12 are relatively advanced (not necessar¬ 
ily difficult); the choice of material out of these chapters will depend entirely on the taste of 
the instructor. To facilitate the understanding of the subject, the text has been illustrated with 
a large number of graphs; to assess the understanding, a large number of problems have been 
included. I hope these features are found useful. 


xtx 
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I feel that one of the most essential aspects of teaching is to arouse the curiosity of the 
students in their subject, and one of the most effective ways of doing this is to discuss with them 
(in a reasonable measure, of course) the circumstances that led to the emergence of the subject. 
One would, therefore, like to stop occasionally to reflect upon the manner in which the various 
developments really came about; at the same time, one may not like the flow of the text to be 
hampered by the discontinuities arising from an intermittent addition of historical material. 
Accordingly, I decided to include in this account a historical introduction to the subject which 
stands separate from the main text. I trust the readers, especially the instructors, will find it of 
interest. 

For those who wish to continue their study of statistical mechanics beyond the confines 
of this book, a fairly extensive bibliography is included. It contains a variety of references — old 
as well as new, experimental as well as theoretical, technical as well as pedagogical. I hope that 
this will make the book useful for a wider readership. 

The completion of this task has left me indebted to many. Like most authors, I owe con¬ 
siderable debt to those who have written on the subject before. The bibliography at the end of 
the book is the most obvious tribute to them; nevertheless, I would like to mention, in particu¬ 
lar, the works of the Ehrenfests, Fowler, Guggenheim, Schrodinger, Rushbrooke, ter Haar, Hill, 
Landau and Lifshitz, Huang, and Kubo, which have been my constant reference for several years 
and have influenced my understanding of the subject in a variety of ways. As for the preparation 
of the text, I am indebted to Robert Teshima who drew most of the graphs and checked most of 
the problems, to Ravindar Bansal, Vishwa Mittar, and Surjit Singh who went through the entire 
manuscript and made several suggestions that helped me unkink the exposition at a number 
of points, to Mary Annetts who typed the manuscript with exceptional patience, diligence and 
care, and to Fred Hetzel, Jim Briante, and Larry Kry who provided technical help during the 
preparation of the final version. 

As this work progressed I felt increasingly gratified toward Professors F. C. Auluck and 
D. S. Kothari of the University of Delhi with whom I started my career and who initiated me 
into the study of this subject, and toward Professor R. C. Majumdar who took keen interest 
in my work on this and every other project that I have undertaken from time to time. I am 
grateful to Dr. D. ter Haar of the University of Oxford who, as the general editor of this series, 
gave valuable advice on various aspects of the preparation of the manuscript and made several 
useful suggestions toward the improvement of the text. I am thankful to Professors J. W. Leech, 
J. Grindlay, and A. D. Singh Nagi of the University of Waterloo for their interest and hospitality 
that went a long way in making this task a pleasant one. 

The final tribute must go to my wife whose cooperation and understanding, at all stages 
of this project and against all odds, have been simply overwhelming. 


R.K.P. 



Historical Introduction 


Statistical mechanics is a formalism that aims at explaining the physical properties of matter 
in bulk on the basis of the dynamical behavior of its microscopic constituents. The scope of the 
formalism is almost as unlimited as the very range of the natural phenomena, for in principle it 
is applicable to matter in any state whatsoever. It has, in fact, been applied, with considerable 
success, to the study of matter in the solid state, the liquid state, or the gaseous state, mat¬ 
ter composed of several phases and/or several components, matter under extreme conditions 
of density and temperature, matter in equilibrium with radiation (as, for example, in astro¬ 
physics), matter in the form of a biological specimen, and so on. Furthermore, the formalism 
of statistical mechanics enables us to investigate the nonequilibrium states of matter as well as 
the equilibrium states; indeed, these investigations help us to understand the manner in which 
a physical system that happens to be “out of equilibrium” at a given time t approaches a "state 
of equilibrium” as time passes. 

In contrast with the present status of its development, the success of its applications, and 
the breadth of its scope, the beginnings of statistical mechanics were rather modest. Barring 
certain primitive references, such as those of Gassendi, Hooke, and so on, the real work on this 
subject started with the contemplations of Bernoulli (1738), Herapath (1821), and joule (1851) 
who, in their own individual ways, attempted to lay a foundation for the so-called kinetic the¬ 
ory of gases — a discipline that finally turned out to be the forerunner of statistical mechanics. 
The pioneering work of these investigators established the fact that the pressure of a gas arose 
from the motion of its molecules and could, therefore, be computed by considering the dynam¬ 
ical influence of the molecular bombardment on the walls of the container. Thus, Bernoulli 
and Herapath could show that, if temperature remained constant, the pressure P of an ordi¬ 
nary gas was inversely proportional to the volume V of the container (Boyle’s law), and that it 
was essentially independent of the shape of the container. This, of course, involved the explicit 
assumption that, at a given temperature T, the (mean) speed of the molecules was independent 
of both pressure and volume. Bernoulli even attempted to determine the (first-order) correc¬ 
tion to this law, arising from the finite size of the molecules, and showed that the volume V 
appearing in the statement of the law should be replaced by (V — b), where b is the “actual” 
volume of the molecules. 1 

Joule was the first to show that the pressure P was directly proportional to the square of 
the molecular speed c, which he had initially assumed to be the same for all molecules. Kronig 
(1856) went a step further. Introducing the “quasistatistical” assumption that, at any time t, 

'As is well known, this “correction” was correctly evaluated, much later, by van der Waals (1873) who showed that, 
for large V, b is four times the “actual” volume of the molecules; see Problem 1.4. 
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one-sixth of the molecules could be assumed to be flying in each of the six “independent” 
directions, namely +x, —x, +y, —y, +z, and — z, he derived the equation 


where n is the number density of the molecules and m the molecular mass. Kronig, too, 
assumed the molecular speed c to be the same for all molecules; so from (1), he inferred that 
the kinetic energy of the molecules should be directly proportional to the absolute temperature 
of the gas. 

Kronig justified his method in these words: “The path of each molecule must be so irreg¬ 
ular that it will defy all attempts at calculation. However, according to the laws of probability, 
one could assume a completely regular motion in place of a completely irregular one!” It must, 
however, be noted that it is only because of the special form of the summations appearing in the 
calculation of the pressure that Kronig’s argument leads to the same result as the one following 
from more refined models. In other problems, such as the ones involving diffusion, viscosity, or 
heat conduction, this is no longer the case. 

It was at this stage that Clausius entered the field. First of all, in 1857, he derived the 
ideal-gas law under assumptions far less stringent than Kronig’s. He discarded both leading 
assumptions of Kronig and showed that equation (1) was still true; of course, c 2 now became 
the mean square speed of the molecules. In a later paper (1859), Clausius introduced the con¬ 
cept of the mean free path and thus became the first to analyze transport phenomena. It was in 
these studies that he introduced the famous “Stosszahlansatz” — the hypothesis on the number 
of collisions (among the molecules) — which, later on, played a prominent role in the monu¬ 
mental work of Boltzmann. 2 With Clausius, the introduction of the microscopic and statistical 
points of view into the physical theory was definitive, rather than speculative. Accordingly, 
Maxwell, in a popular article entitled “Molecules,” written for the Encyclopedia Britannica, 
referred to Clausius as the “principal founder of the kinetic theory of gases,” while Gibbs, in 
his Clausius obituary notice, called him the “father of statistical mechanics.” 3 

The work of Clausius attracted Maxwell to the field. He made his first appearance with 
the memoir “Illustrations in the dynamical theory of gases” (1860), in which he went much 
farther than his predecessors by deriving his famous law of the “distribution of molecular 
speeds.” Maxwell’s derivation was based on elementary principles of probability and was 
clearly inspired by the Gaussian law of “distribution of random errors.” A derivation based on 
the requirement that “the equilibrium distribution of molecular speeds, once acquired, should 
remain invariant under molecular collisions” appeared in 1867. This led Maxwell to establish 
what is known as Maxwell’s transport equation which, if skilfully used, leads to the same results 
as one gets from the more fundamental equation due to Boltzmann. 4 

Maxwell’s contributions to the subject diminished considerably after his appointment, 
in 1871, as the Cavendish Professor at Cambridge. By that time Boltzmann had already made 
his first strides. In the period 1868-1871 he generalized Maxwell’s distribution law to poly¬ 
atomic gases, also taking into account the presence of external forces, if any; this gave rise 
to the famous Boltzmann factor exp(— fe), where e denotes the total energy of a molecule. 
These investigations also led to the equipartition theorem. Boltzmann further showed that, just 

2 For an excellent review of this and related topics, see Ehrenfest and Ehrenfest (1912). 

3 For further details, refer to Montroll (1963) where an account is also given of the pioneering work of Waterston (1846, 
1892). 

4 This equivalence has been demonstrated in Guggenheim (1960) where the coefficients of viscosity, thermal 
conductivity, and diffusion of a gas of hard spheres have been calculated on the basis of Maxwell’s transport equation. 
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like the original distribution of Maxwell, the generalized distribution (which we now call the 
Maxwell-Boltzmann distribution) is stationary with respect to molecular collisions. 

In 1872 came the celebrated H-theorem, which provided a molecular basis for the natural 
tendency of physical systems to approach, and stay in, a state of equilibrium. This established 
a connection between the microscopic approach (which characterizes statistical mechan¬ 
ics) and the phenomenological approach (which characterized thermodynamics) much more 
transparently than ever before; it also provided a direct method for computing the entropy 
of a given physical system from purely microscopic considerations. As a corollary to the H- 
theorem, Boltzmann showed that the Maxwell-Boltzmann distribution is the only distribution 
that stays invariant under molecular collisions and that any other distribution, under the influ¬ 
ence of molecular collisions, will ultimately go over to a Maxwell-Boltzmann distribution. In 
1876 Boltzmann derived his famous transport equation, which, in the hands of Chapman and 
Enskog (1916-1917), has proved to be an extremely powerful tool for investigating macroscopic 
properties of systems in nonequilibrium states. 

Things, however, proved quite harsh for Boltzmann. His H-theorem, and the consequent 
irreversible behavior of physical systems, came under heavy attack, mainly from Loschmidt 
(1876-1877) and Zermelo (1896). While Loschmidt wondered how the consequences of this 
theorem could be reconciled with the reversible character of the basic equations of motion 
of the molecules, Zermelo wondered how these consequences could be made to fit with the 
quasiperiodic behavior of closed systems (which arose in view of the so-called Poincare cycles). 
Boltzmann defended himself against these attacks with all his might but, unfortunately, could 
not convince his opponents of the correctness of his viewpoint. At the same time, the energeti- 
cists, led by Mach and Ostwald, were criticizing the very (molecular) basis of the kinetic theory, 5 
while Kelvin was emphasizing the “nineteenth-century clouds hovering over the dynamical 
theory of light and heat.” 6 7 

All this left Boltzmann in a state of despair and induced in him a persecution complex.' 
He wrote in the introduction to the second volume of his treatise Vorlesungen liber Gastheorie 
(1898) : 8 

I am convinced that the attacks (on the kinetic theory) rest on misunderstandings and 
that the role of the kinetic theory is not yet played out. In my opinion it would be a blow 
to science if contemporary opposition were to cause kinetic theory to sink into the oblivion 
which was the fate suffered by the wave theory of light through the authority of Newton. 

I am aware of the weakness of one individual against the prevailing currents of opinion. 

In order to insure that not too much will have to be rediscovered when people return to 
the study of kinetic theory I will present the most difficult and misunderstood parts of the 
subject in as clear a manner as I can. 

We shall not dwell any further on the kinetic theory; we would rather move on to the 
development of the more sophisticated approach known as the ensemble theory, which may in 
fact be regarded as the statistical mechanics proper. 9 In this approach, the dynamical state of a 

5 These critics were silenced by Einstein whose work on the Brownian motion (1905b) established atomic theory once 
and for all. 

6 The first of these clouds was concerned with the mysteries of the “aether,” and was dispelled by the theory of relativ¬ 
ity. The second was concerned with the inadequacy of the “equipartition theorem,” and was dispelled by the quantum 
theory. 

7 Some people attribute Boltzmann’s suicide on September 5, 1906 to this cause. 

8 Quotation from Montroll (1963). 

9 For a review of the historical development of kinetic theory leading to statistical mechanics, see Brush (1957, 1958, 
1961a, b, 1965-1966). 
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given system, as characterized by the generalized coordinates qi and the generalized momenta 
Pi, is represented by a phase point G(qi, pp in a phase space of appropriate dimensionality. The 
evolution of the dynamical state in time is depicted by the trajectory of the G-point in the phase 
space, the “geometry” of the trajectory being governed by the equations of motion of the system 
and by the nature of the physical constraints imposed on it. To develop an appropriate formal¬ 
ism, one considers the given system along with an infinitely large number of “mental copies” 
thereof; that is, an ensemble of similar systems under identical physical constraints (though, at 
any time t, the various systems in the ensemble would differ widely in respect of their dynam¬ 
ical states). In the phase space, then, one has a swarm of infinitely many G-points (which, at 
any time t, are widely dispersed and, with time, move along their respective trajectories). The 
fiction of a host of infinitely many, identical but independent, systems allows one to replace 
certain dubious assumptions of the kinetic theory of gases by readily acceptable statements of 
statistical mechanics. The explicit formulation of these statements was first given by Maxwell 
(1879) who on this occasion used the word “statistico-mechanical” to describe the study of 
ensembles (of gaseous systems) — though, eight years earlier, Boltzmann (1871) had already 
worked with essentially the same kind of ensembles. 

The most important quantity in the ensemble theory is the density function, p(qi,pi',t), 
of the G-points in the phase space; a stationary distribution (dp/dt = 0) characterizes a sta¬ 
tionary ensemble, which in turn represents a system in equilibrium. Maxwell and Boltzmann 
confined their study to ensembles for which the function p depended solely on the energy E of 
the system. This included the special case of ergodic systems, which were so defined that “the 
undisturbed motion of such a system, if pursued for an unlimited time, would ultimately tra¬ 
verse (the neighborhood of) each and every phase point compatible with the fixed value E of 
the energy. ” Consequently, the ensemble average, (f), of a physical quantity/, taken at any given 
time t, would be the same as the long-time average,/, pertaining to any given member of the 
ensemble. Now, / is the value we expect to obtain for the quantity in question when we make 
an appropriate measurement on the system; the result of this measurement should, there¬ 
fore, agree with the theoretical estimate (/). We thus acquire a recipe to bring about a direct 
contact between theory and experiment. At the same time, we lay down a rational basis for a 
microscopic theory of matter as an alternative to the empirical approach of thermodynamics! 

A significant advance in this direction was made by Gibbs who, with his Elementary Prin¬ 
ciples of Statistical Mechanics (1902), turned ensemble theory into a most efficient tool for the 
theorist. He emphasized the use of “generalized” ensembles and developed schemes which, in 
principle, enabled one to compute a complete set of thermodynamic quantities of a given sys¬ 
tem from purely mechanical properties of its microscopic constituents. 10 In its methods and 
results, the work of Gibbs turned out to be much more general than any preceding treatment 
of the subject; it applied to any physical system that met the simple-minded requirements 
that (i) it was mechanical in structure and (ii) it obeyed Lagrange’s and Hamilton’s equa¬ 
tions of motion. In this respect, Gibbs’s work may be considered to have accomplished for 
thermodynamics as much as Maxwell’s had accomplished for electrodynamics. 

These developments almost coincided with the great revolution that Planck’s work of 
1900 brought into physics. As is well known, Planck’s quantum hypothesis successfully resolved 
the essential mysteries of the black-body radiation — a subject where the three best-established 
disciplines of the nineteenth century, namely mechanics, electrodynamics, and thermodynam¬ 
ics, were all focused. At the same time, it uncovered both the strengths and the weaknesses 
of these disciplines. It would have been surprising if statistical mechanics, which linked 
thermodynamics with mechanics, could have escaped the repercussions of this revolution. 

10 In much the same way as Gibbs, but quite independently of him, Einstein (1902,1903) also developed the theory of 
ensembles. 
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The subsequent work of Einstein (1905a) on the photoelectric effect and of Compton 
(1923a,b) on the scattering of x-rays established, so to say, the “existence” of the quan¬ 
tum of radiation, or the photon as we now call it. 11 It was then natural for someone to 
derive Planck’s radiation formula by treating black-body radiation as a gas of photons in the 
same way as Maxwell had derived his law of distribution of molecular speeds for a gas of 
conventional molecules. But, then, does a gas of photons differ so radically from a gas of 
conventional molecules that the two laws of distribution should be so different from one 
another? 

The answer to this question was provided by the manner in which Planck’s formula was 
derived by Bose. In his historic paper of 1924, Bose treated black-body radiation as a gas of pho¬ 
tons; however, instead of considering the allocation of the “individual” photons to the various 
energy states of the system, he fixed his attention on the number of states that contained “a par¬ 
ticular number” of photons. Einstein, who seems to have translated Bose’s paper into German 
from an English manuscript sent to him by the author, at once recognized the importance of 
this approach and added the following note to his translation: “Bose’s derivation of Planck’s 
formula is in my opinion an important step forward. The method employed here would also 
yield the quantum theory of an ideal gas, which I propose to demonstrate elsewhere.” 

Implicit in Bose’s approach was the fact that in the case of photons what really mat¬ 
tered was “the set of numbers of photons in various energy states of the system” and not the 
specification as to “which photon was in which state”; in other words, photons were mutu¬ 
ally indistinguishable. Einstein argued that what Bose had implied for photons should be 
true for material particles as well (for the property of indistinguishability arose essentially 
from the wave character of these entities and, according to de Broglie, material particles also 
possessed that character). 12 In two papers, which appeared soon after, Einstein (1924, 1925) 
applied Bose’s method to the study of an ideal gas and thereby developed what we now call 
Bose-Einstein statistics. In the second of these papers, the fundamental difference between 
the new statistics and the classical Maxwell-BoItzmann statistics comes out so transparently 
in terms of the indistinguishability of the molecules. 13 In the same paper, Einstein discovered 
the phenomenon of Bose-Einstein condensation which, 13 years later, was adopted by London 
(1938a,b) as the basis for a microscopic understanding of the curious properties of liquid He 4 
at low temperatures. 

Following the enunciation of Pauli’s exclusion principle (1925), Fermi (1926) showed that 
certain physical systems would obey a different kind of statistics, namely the Fermi-Dirac 
statistics, in which not more than one particle could occupy the same energy state («,■ = 0,1). It 
seems important to mention here that Bose’s method of 1924 leads to the Fermi-Dirac dis¬ 
tribution as well, provided that one limits the occupancy of an energy state to at most one 
particle. 14 


u Strictly speaking, it might be somewhat misleading to cite Einstein’s work on the photoelectric effect as a proof of 
the existence of photons. In fact, many of the effects (including the photoelectric effect), for which it seems necessary 
to invoke photons, can be explained away on the basis of a wave theory of radiation. The only phenomena for which 
photons seem indispensable are the ones involving fluctuations, such as the Hanbury Brown-Twiss effect or the Lamb 
shift. For the relevance of fluctuations to the problem of radiation, see ter Haar (1967, 1968). 

12 Of course, in the case of material particles, the total number N (of the particles) will also have to be conserved; this 
had not to be done in the case of photons. For details, see Section 6.1. 

13 It is here that one encounters the correct method of counting “the number of distinct ways in which g; energy states 
can accommodate «,■ particles,” depending on whether the particles are (i) distinguishable or (ii) indistinguishable. The 
occupancy of the individual states was, in each case, unrestricted, that is, n,- = 0,1,2,.... 

14 Dirac, who was the first to investigate the connection between statistics and wave mechanics, showed, in 1926, that 
the wave functions describing a system of identical particles obeying Bose-Einstein (or Fermi-Dirac) statistics must be 
symmetric (or antisymmetric) with respect to an interchange of two particles. 
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Soon after its appearance, the Fermi-Dirac statistics were applied by Fowler (1926) to 
discuss the equilibrium states of white dwarf stars and by Pauli (1927) to explain the weak, 
temperature-independent paramagnetism of alkali metals; in each case, one had to deal with a 
“highly degenerate” gas of electrons that obey Fermi-Dirac statistics. In the wake of this, Som- 
merfeld produced his monumental work of 1928 that not only put the electron theory of metals 
on a physically secure foundation but also gave it a fresh start in the right direction. Thus, Som- 
merfeld could explain practically all the major properties of metals that arose from conduction 
electrons and, in each case, obtained results that showed much better agreement with exper¬ 
iment than the ones following from the classical theories of Riecke (1898), Drude (1900), and 
Lorentz (1904-1905). Around the same time, Thomas (1927) and Fermi (1928) investigated the 
electron distribution in heavier atoms and obtained theoretical estimates for the relevant bind¬ 
ing energies; these investigations led to the development of the so-called Thomas-Fermi model 
of the atom, which was later extended so that it could be applied to molecules, solids, and nuclei 
as well. 15 

Thus, the whole structure of statistical mechanics was overhauled by the introduction 
of the concept of indistinguishability of (identical) particles. 16 The statistical aspect of the 
problem, which was already there in view of the large number of particles present, was now 
augmented by another statistical aspect that arose from the probabilistic nature of the wave 
mechanical description. One had, therefore, to carry out a two-fold averaging of the dynamical 
variables over the states of the given system in order to obtain the relevant expectation val¬ 
ues. That sort of a situation was bound to necessitate a reformulation of the ensemble theory 
itself, which was carried out step by step. First, Landau (1927) and von Neumann (1927) intro¬ 
duced the so-called density matrix, which was the quantum-mechanical analogue of the density 
function of the classical phase space; this was elaborated, both from statistical and quantum- 
mechanical points of view, by Dirac (1929-1931). Guided by the classical ensemble theory, these 
authors considered both microcanonical and canonical ensembles; the introduction of grand 
canonical ensembles in quantum statistics was made by Pauli (1927). 17 

The important question as to which particles would obey Bose-Einstein statistics and 
which Fermi-Dirac remained theoretically unsettled until Belinfante (1939) and Pauli (1940) 
discovered the vital connection between spin and statistics. 18 ft turns out that those particles 
whose spin is an integral multiple of h obey Bose-Einstein statistics while those whose spin 
is a half-odd integral multiple of h obey Fermi-Dirac statistics. To date, no third category of 
particles has been discovered. 

Apart from the foregoing milestones, several notable contributions toward the devel¬ 
opment of statistical mechanics have been made from time to time; however, most of those 
contributions were concerned with the development or perfection of mathematical techniques 
that make application of the basic formalism to actual physical problems more fruitful. A review 
of these developments is out of place here; they will be discussed at their appropriate place in 
the text. 


15 For an excellent review of this model, see March (1957). 

16 Of course, in many a situation where the wave nature of the particles is not so important, classical statistics continue 
to apply. 

17 A detailed treatment of this development has been given by Kramers (1938). 

18 See also Liiders and Zumino (1958). 


The Statistical Basis 
of Thermodynamics 


In the annals of thermal physics, the 1850s mark a very definite epoch. By that time the 
science of thermodynamics, which grew essentially out of an experimental study of the 
macroscopic behavior of physical systems, had become, through the work of Carnot, foule, 
Clausius, and Kelvin, a secure and stable discipline of physics. The theoretical conclusions 
following from the first two laws of thermodynamics were found to be in very good agree¬ 
ment with the corresponding experimental results. 1 At the same time, the kinetic theory of 
gases, which aimed at explaining the macroscopic behavior of gaseous systems in terms of 
the motion of their molecules and had so far thrived more on speculation than calculation, 
began to emerge as a real, mathematical theory. Its initial successes were glaring; however, 
a real contact with thermodynamics could not be made until about 1872 when Boltzmann 
developed his //-theorem and thereby established a direct connection between entropy on 
one hand and molecular dynamics on the other. Almost simultaneously, the conventional 
(kinetic) theory began giving way to its more sophisticated successor — the ensemble the¬ 
ory. The power of the techniques that finally emerged reduced thermodynamics to the 
status of an “essential” consequence of the get-together of the statistics and the mechan¬ 
ics of the molecules constituting a given physical system. It was then natural to give the 
resulting formalism the name Statistical Mechanics. 

As a preparation toward the development of the formal theory, we start with a few 
general considerations regarding the statistical nature of a macroscopic system. These 
considerations will provide ground for a statistical interpretation of thermodynamics. It 
may be mentioned here that, unless a statement is made to the contrary, the system under 
study is supposed to be in one of its equilibrium states. 


1.1 The macroscopic and the microscopic states 

We consider a physical system composed of N identical particles confined to a space of 
volume V. In a typical case, N would be an extremely large number — generally, of order 
10 23 . In view of this, it is customary to carry out analysis in the so-called thermodynamic 
limit, namely N —>■ oo, V -»• oo (such that the ratio N/V, which represents the particle den¬ 
sity n, stays fixed at a preassigned value). In this limit, the extensive properties of the system 

'The third law, which is also known as Nernst’s heat theorem, did not arrive until about 1906. For a general discussion 
of this law, see Simon (1930) and Wilks (1961); these references also provide an extensive bibliography on this subject. 
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become directly proportional to the size of the system (i.e., proportional to N or to V), 
while the intensive properties become independent thereof; the particle density, of course, 
remains an important parameter for all physical properties of the system. 

Next we consider the total energy E of the system. If the particles comprising the system 
could be regarded as noninteracting, the total energy E would be equal to the sum of the 
energies e z of the individual particles: 


E-'Y^n i si, ( 1 ) 

i 

where ri j denotes the number of particles each with energy e z . Clearly, 

N=J2 n i- (23 

i 

According to quantum mechanics, the single-particle energies e z are discrete and their val¬ 
ues depend crucially on the volume V to which the particles are confined. Accordingly, the 
possible values of the total energy E are also discrete. However, for large V, the spacing of 
the different energy values is so small in comparison with the total energy of the system 
that the parameter E might well be regarded as a continuous variable. This would be true 
even if the particles were mutually interacting; of course, in that case the total energy E 
cannot be written in the form (1). 

The specification of the actual values of the parameters N, V, and E then defines a 
macrostate of the given system. 

At the molecular level, however, a large number of possibilities still exist because at 
that level there will in general be a large number of different ways in which the macrostate 
( N, V, E) of the given system can be realized. In the case of a noninteracting system, since 
the total energy E consists of a simple sum of the N single-particle energies e z , there will 
obviously be a large number of different ways in which the individual e, can be chosen so as 
to make the total energy equal to E. In other words, there will be a large number of different 
ways in which the total energy E of the system can be distributed among the N particles 
constituting it. Each of these (different) ways specifies a microstate, or complexion, of the 
given system. In general, the various microstates, or complexions, of a given system can 
be identified with the independent solutions \jr (ri,..., rjv) of the Schrodinger equation of 
the system, corresponding to the eigenvalue E of the relevant Hamiltonian. In any case, 
to a given macrostate of the system there does in general correspond a large number of 
microstates and it seems natural to assume, when there are no other constraints, that at 
any time t the system is equally likely to be in any one of these microstates. This assump¬ 
tion forms the backbone of our formalism and is generally referred to as the postulate of 
“equal a priori probabilities” for all microstates consistent with a given macrostate. 

The actual number of all possible microstates will, of course, be a function of N, V, 
and E and may be denoted by the symbol Q(N, V,E ); the dependence on V comes in 
because the possible values e z of the single-particle energy e are themselves a function 
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of this parameter. 2 Remarkably enough, it is from the magnitude of the number £2, and 
from its dependence on the parameters N, V, and £, that complete thermodynamics of 
the given system can be derived! 

We shall not stop here to discuss the ways in which the number £2 (N, V, E) can be com¬ 
puted; we shall do that only after we have developed our considerations sufficiently so that 
we can carry out further derivations from it. First we have to discover the manner in which 
this number is related to any of the leading thermodynamic quantities. To do this, we con¬ 
sider the problem of “thermal contact” between two given physical systems, in the hope 
that this consideration will bring out the true nature of the number £2. 


1.2 Contact between statistics and thermodynamics: 
physical significance of the number Q(N, V,E ) 

We consider two physical systems, Ai and A 2 , which are separately in equilibrium; see 
Figure 1.1. Let the macrostate of A\ be represented by the parameters N\, Vf, and E\ so 
that it has £ 2 i(iVi, V\,Ei) possible microstates, and the macrostate of A 2 be represented by 
the parameters A/2, V2, and £2 so that it has £22 (A/2, V2,£2) possible microstates. The math¬ 
ematical form of the function £2i may not be the same as that of the function Q 2 , because 
that ultimately depends on the nature of the system. We do, of course, believe that all 
thermodynamic properties of the systems A\ and A 2 can be derived from the functions 
S 2 i(iVi, V\,Ei) and £2 2 (A/2, V2, £2), respectively. 

We now bring the two systems into thermal contact with each other, thus allowing the 
possibility of exchange of energy between the two; this can be done by sliding in a con¬ 
ducting wall and removing the impervious one. For simplicity, the two systems are still 
separated by a rigid, impenetrable wall, so that the respective volumes Vi and V 2 and the 
respective particle numbers N\ and Afc remain fixed. The energies £1 and £2, however, 
become variable and the only condition that restricts their variation is 

E <0> =E\+E 2 = const. (1) 




a 2 



(A/i, V„E,) 

(N 2 , v 2 , E 2 ) 



FIGURE 1.1 Two physical systems being brought into thermal contact. 

2 It may be noted that the manner in which the s; depend on V is itself determined by the nature of the system. For 
instance, it is not the same for relativistic systems as it is for nonrelativistic ones; compare, for instance, the cases dealt 
with in Section 1.4 and in Problem 1.7. We should also note that, in principle, the dependence of ft on V arises from 
the fact that it is the physical dimensions of the container that appear in the boundary conditions imposed on the wave 
functions of the system. 
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Here, E i0) denotes the energy of the composite system A (0) (= A\ +A 2 ); the energy of inter¬ 
action beLvveen hi and A 2 , if any, is being neglected. Now, at anytime f, the subsystem A\ is 
equally likely to be in any one of the Q] (E{) microstates while the subsystem A 2 is equally 
likely to be in any one of the El 2 (E 2 ) microstates; therefore, the composite system A a)) is 
equally likely to be in any one of the 

ni(Bi)n 2 (£2) = fii(Ei)fi 2 (E (0) -Ei) = ^ (0, (£ (0) ,£i) (2) 

microstates. 3 Clearly, the number Q a)> itself varies with E\. The question now arises: at 
what value of E\ will the composite system be in equilibrium? In other words, how far 
will the energy exchange go in order to bring the subsystems A\ and A 2 into mutual 
equilibrium? 

We assert that this will happen at that value of E\ which maximizes the number 
Q i(y> (E ity> ,E\ ). The philosophy behind this assertion is that a physical system, left to itself, 
proceeds naturally in a direction that enables it to assume an ever-increasing number 
of microstates until it finally settles down in a macrostate that affords the largest pos¬ 
sible number of microstates. Statistically speaking, we regard a macrostate with a larger 
number of microstates as a more probable state, and the one with the largest number of 
microstates as the most probable one. Detailed studies show that, for a typical system, 
the number of microstates pertaining to any macrostate that departs even slightly from 
the most probable one is “orders of magnitude” smaller than the number pertaining to 
the latter. Thus, the most probable state of a system is the macrostate in which the system 
spends an “overwhelmingly” large fraction of its time. It is then natural to identify this state 
with the equilibrium state of the system. 

Denoting the equilibrium value of Ei by E\ (and that of E 2 by E 2 ), we obtain, on 
maximizing tT 01 , 


/ 3fll(£l) \ 

V 9-Ei ) Ei=Ei 


n 2 (E 2 ) + th(£i) 


dQ.2(E 2 ) \ dE 2 

dE 2 )e 2 =e 2 dE i 


Since dE 2 /dE\ = — 1, see equation (1), the foregoing condition can be written as 

/ 9 In Q] (E \) \ _ / 3ln£2 2 (£ 2 ) \ 

V dEi ) e 1 =e 1 \ d E 2 ) e 2 =e 2 


Thus, our condition for equilibrium reduces to the equality of the parameters and ft 2 
of the subsystems Ai and A 2 , respectively, where fi is defined by 


/ 3 In £2 (IV, V,E) 
V 3 E 


) N,V,E=E 


(3) 


3 It is obvious that the macrostate of the composite system A <0) has to be defined by two energies, namely Ei and E 2 
(or else f? (0) and Ei). 
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We thus find that when two physical systems are brought into thermal contact, which 
allows an exchange of energy between them, this exchange continues until the equilibrium 
values E\ and £2 of the variables £1 and £2 are reached. Once these values are reached, 
there is no more net exchange of energy between the two systems; the systems are then 
said to have attained a state of thermal equilibrium. According to our analysis, this hap¬ 
pens only when the respective values of the parameter ft, namely fti and ft 2 , become 
equal. 4 It is then natural to expect that the parameter ft is somehow related to the ther¬ 
modynamic temperature T of a given system. To determine this relationship, we recall the 
thermodynamic formula 



where S is the entropy of the system in question. Comparing equations (3) and (4), we 
conclude that an intimate relationship exists between the thermodynamic quantity S and 
the statistical quantity £2; we may, in fact, write for any physical system 

AS 1 

—--= — = const. (5) 

A (In £2) ftT 

This correspondence was first established by Boltzmann who also believed that, since 
the relationship between the thermodynamic approach and the statistical approach seems 
to be of a fundamental character, the constant appearing in (5) must be a universal 
constant. It was Planck who first wrote the explicit formula 

S=fcln£2, (6) 

without any additive constant So- As it stands, formula (6) determines the absolute value of 
the entropy of a given physical system in terms of the total number of microstates acces¬ 
sible to it in conformity with the given macrostate. The zero of entropy then corresponds 
to the special state for which only one microstate is accessible (£2 = 1) — the so-called 
“unique configuration”; the statistical approach thus provides a theoretical basis for the 
third law of thermodynamics as well. Formula (6) is of fundamental importance in physics; 
it provides a bridge between the microscopic and the macroscopic. 

Now, in the study of the second law of thermodynamics we are told that the law of 
increase of entropy is related to the fact that the energy content of the universe, in its 
natural course, is becoming less and less available for conversion into work; accordingly, 
the entropy of a given system may be regarded as a measure of the so-called disorder or 
chaos prevailing in the system. Formula (6) tells us how disorder arises microscopically. 
Clearly, disorder is a manifestation of the largeness of the number of microstates the sys¬ 
tem can have. The larger the choice of microstates, the lesser the degree of predictability 
and hence the increased level of disorder in the system. Complete order prevails when and 

4 This result may be compared with the so-called ‘‘zeroth law of thermodynamics,” which stipulates the existence of 
a common parameter T for two or more physical systems in thermal equilibrium. 
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only when the system has no other choice but to be in a unique state (£2 = 1); this, in turn, 
corresponds to a state of vanishing entropy. 

By equations (5) and (6), we also have 


p = w (7) 

The universal constant k is generally referred to as the Boltzmann constant. In Section 1.4 
we shall discover how k is related to the gas constant R and the Avogadro number Na) see 
equation (1.4.3). 5 


1.3 Further contact between statistics 
and thermodynamics 

In continuation of the preceding considerations, we now examine a more elaborate 
exchange between the subsystems A\ and A 2 . If we assume that the wall separating the 
two subsystems is movable as well as conducting, then the respective volumes Vf and V 2 
(of subsystems A\ and A 2 ) also become variable; indeed, the total volume V {{]) (— Vf + V 2 ) 
remains constant, so that effectively we have only one more independent variable. Of 
course, the wall is still assumed to be impenetrable to particles, so the numbers N\ and 
N 2 remain fixed. Arguing as before, the state of equilibrium for the composite system A (0) 
will obtain when the number Q^(V (0) ,E (0 ^;Vi,Ei) attains its largest value; that is, when 
not only 

fSMA /aJnOA j (la) 

V 3£i /Ni,Vi;Bi=Ei V aE 2 )N 2 ,V 2 ] E 2 =B 2 

but also 


(lb , 

V /Ni.Er.Vi = Vi \ dV 2 /n 2 ,e 2 ; v 2 =v 2 

Our conditions for equilibrium now take the form of an equality between the pair of 
parameters (/h,?n) of the subsystem A\ and the parameters C/S 2 , 112 ) of the subsystem A 2 
where, by definition, 


ri = 


(dlnQ(N, V,E) 
\ dv 


N,E,V=V 


( 2 ) 


Similarly, if Ai and A 2 came into contact through a wall that allowed an exchange of parti¬ 
cles as well, the conditions for equilibrium would be further augmented by the equality 

5 We follow the notation whereby equation (1.4.3) means equation (3) of Section 1.4. However, while referring to an 
equation in the same section, we will omit the mention of the section number. 
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of the parameter of subsystem Ai and the parameter £2 of subsystem A 2 where, by 
definition, 


/ 9 In S2 (iV, V,E) 
v dN 


V,E,N=N 


(3) 


To determine the physical meaning of the parameters i] and we make use of equa¬ 
tion (1.2.6) and the basic formula of thermodynamics, namely 


dE= TdS — PdV + n dN, 


(4) 


where P is the thermodynamic pressure and /i the chemical potential of the given system. 
It follows that 


P , u 

11 = — and t =-. (5) 

1 lcT s kT 

From a physical point of view, these results are completely satisfactory because, thermo¬ 
dynamically as well, the conditions of equilibrium between two systems A± and A 2 , if the 
wall separating them is both conducting and movable (thus making their respective ener¬ 
gies and volumes variable), are indeed the same as the ones contained in equations (la) 
and (lb), namely 


Ti = T 2 and P\=P 2 . (6) 

On the other hand, if the two systems can exchange particles as well as energy but 
have their volumes fixed, the conditions of equilibrium, obtained thermodynamically, are 
indeed 


T\ = T 2 and mi = M 2 - (7) 

And finally, if the exchange is such that all three (macroscopic) parameters become 
variable, then the conditions of equilibrium become 

T] — T 2i P] = P 2l and Ml—M 2 - (8) 6 

It is gratifying that these conclusions are identical to the ones following from statistical 
considerations. 

Combining the results of the foregoing discussion, we arrive at the following recipe 
for deriving thermodynamics from a statistical beginning: determine, for the macrostate 
(N, V, E) of the given system, the number of all possible microstates accessible to the sys¬ 
tem; call this number Q (N, V, E). Then, the entropy of the system in that state follows from 

6 It may be noted that the same would be true for any two parts of a single thermodynamic system; consequently, in 
equilibrium, the parameters T,P, and u would be constant throughout the system. 
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the fundamental formula 


S(N, V,E) = klnQ(N, V,E), (9) 

while the leading intensive fields, namely temperature, pressure, and chemical potential, 
are given by 


/as\ _ i /as\ _ p /as\ _ m 

{dEj N y-f ; W) N:E ~f ; \91V / VE ~ ~T 

Alternatively, we can write 7 


\3VJn,e/ \dE/N,V JN,S 

and 

--(SU(S),g(SL 

while 


( 10 ) 


( 11 ) 


( 12 ) 


T = 



(13) 


Formulae (11) through (13) follow equally well from equation (4). The evaluation of P,p, 
and T from these formulae indeed requires that the energy E be expressed as a function 
of the quantities N, V, and S; this should, in principle, be possible once S is known as a 
function of N, V, and E. 

The rest of the thermodynamics follows straightforwardly; see Appendix H. For 
instance, the Flelmholtz free energy A, the Gibbs free energy G, and the enthalpy H are 
given by 


A = E — TS, (14) 

G = A + PV = E- TS + PV 

= /hV (15) 8 


7 In writing these formulae, we have made use of the well-known relationship in partial differential calculus, namely 
that “if three variables x, y, and z are mutually related, then (see Appendix H) 

(H) Z (S) X (H) J ,=- 1 -' 

s The relation E— TS + PV = /iN follows directly from (4). For this, all we have to do is to regard the given system 
as having grown to its present size in a gradual manner, such that the intensive parameters, T,P, and /r stayed constant 
throughout the process while the extensive parameters N, V, and E (and hence S) grew proportionately with one another. 
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and 


H = E + PV=G+TS. 


(16) 


The specific heat at constant volume, Cy, and the one at constant pressure, Cp, would be 
given by 


and 


C V =T 


dS\ 

dT /N,V 



N,V 


O-r(H) _(M) .(f) . 

/ N,P \ / N,P \°T/n,P 


(17) 


(18) 


1.4 The classical ideal gas 

To illustrate the approach developed in the preceding sections, we shall now derive 
the various thermodynamic properties of a classical ideal gas composed of monatomic 
molecules. The main reason why we choose this highly specialized system for considera¬ 
tion is that it affords an explicit, though asymptotic, evaluation of the number Q(N, V, E). 
This example becomes all the more instructive when we find that its study enables us, 
in a most straightforward manner, to identify the Boltzmann constant k in terms of 
other physical constants; see equation (3). Moreover, the behavior of this system serves 
as a useful reference with which the behavior of other physical systems, especially real 
gases (with or without quantum effects), can be compared. And, indeed, in the limit of 
high temperatures and low densities the ideal-gas behavior becomes typical of most real 
systems. 

Before undertaking a detailed study of this case it appears worthwhile to make a remark 
that applies to all classical systems composed of noninteracting particles, irrespective 
of the internal structure of the particles. This remark is related to the explicit dependence 
of the number Q (N, V, E) on V and hence to the equation of state of these systems. Now, 
if there do not exist any spatial correlations among the particles, that is, if the probability 
of any one of them being found in a particular region of the available space is completely 
independent of the location of the other particles, 9 then the total number of ways in which 
the N particles can be spatially distributed in the system will be simply equal to the prod¬ 
uct of the numbers of ways in which the individual particles can be accommodated in the 
same space independently of one another. With N and E fixed, each of these numbers will 
be directly proportional to V, the volume of the container; accordingly, the total number 
of ways will be directly proportional to the Nth power of V: 

Q(N,E, V) a V N . (1) 


9 This will be true if (i) the mutual interactions among particles are negligible, and (ii) the wave packets of individual 
particles do not significantly overlap (or, in other words, the quantum effects are also negligible). 



10 Chapter 1 • The Statistical Basis of Thermodynamics 


Combined with equations (1.3.9) and (1.3.10), this gives 

P = N 

T \ 9V ) N}E V 

If the system contains n moles of the gas, then N — nN A , where N,\ is the Avogadro number. 
Equation (2) then becomes 


PV = NkT = nRT (R = kN A ), 


(3) 


which is the famous ideal-gas law, R being the gas constant per mole. Thus, for any 
classical system composed of noninteracting particles the ideal-gas law holds. 

For deriving other thermodynamic properties of this system, we require a detailed 
knowledge of the way depends on the parameters N, V, and E. The problem essen¬ 
tially reduces to determining the total number of ways in which equations (1.1.1) and 
(1.1.2) can be mutually satisfied. In other words, we have to determine the total number of 
(independent) ways of satisfying the equation 

3N 

J2 S r = E ’ 

r= 1 


where e r are the energies associated with the various degrees of freedom of the N par¬ 
ticles. The reason why this number should depend on the parameters N and E is quite 
obvious. Nevertheless, this number also depends on the “spectrum of values’’ that the vari¬ 
ables Sy can assume; it is through this spectrum that the dependence on V comes in. Now, 
the energy eigenvalues for a free, nonrelativistic particle confined to a cubical box of side 
L (V = L 3 ), under the condition that the wave function fir) vanishes everywhere on the 
boundary, are given by 


e(n x ,n y ,n z ) 


h 2 

8 mL 2 


(n 2 x + n 2 + 


n l)’ 


n x ,n y ,n z = 1,2,3,..., 


(5) 


where h is Planck’s constant and m the mass of the particle. The number of distinct 
eigenfunctions (or microstates) for a particle of energy s would, therefore, be equal to the 
number of independent, positive-integral solutions of the equation 


[n 2 x + n 2 + nfj 


8mV 2 l 3 e 

P 


( 6 ) 


We may denote this number by fi(l,e, V). Extending the argument, it follows that the 
desired number Q(N,E, V) would be equal to the number of independent, positive- 
integral solutions of the equation 


3 N 


J2- 2 r 


8 mV 2 l 3 E 
h 2 


say. 


(7) 
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An important result follows straightforwardly from equation (7), even before the number 
Q ( N, E, V) is explicitly evaluated. From the nature of the expression appearing on the right 
side of this equation, we conclude that the volume V and the energy E of the system enter 
into the expression for in the form of the combination (V 2/: ''E). Consequently, 

S(N,V,E) = s(n, V 2/3 E^J. (8) 

Hence, for the constancy of S and N, which defines a reversible adiabatic process, 


V 2 l 3 E = const. 


0 ) 


Equation (1.3.11) then gives 


/ dE\ _2E 


( 10 ) 


that is, the pressure of a system of nonrelativistic, noninteracting particles is precisely 
equal to two-thirds of its energy density. 10 It should be noted here that, since an explicit 
computation of the number has not yet been done, results (9) and (10) hold for quan¬ 
tum as well as classical statistics; equally general is the result obtained by combining these, 
namely 


PV 5/3 = const., 


( 11 ) 


which tells us how P varies with V during a reversible adiabatic process. 

We shall now attempt to evaluate the number Q. In this evaluation we shall explicitly 
assume the particles to be distinguishable, so that if a particle in state i gets interchanged 
with a particle in state j the resulting microstate is counted as distinct. Consequently, the 
number £2 (Ai, V, E), or better CyfE*) (see equation (7)), is equal to the number of positive- 
integral lattice points lying on the surface of a 3 N -dimensional sphere of radius JE *. 11 
Clearly, this number will be an extremely irregular function of E*, in that for two given 
values of E* that may be very close to one another, the values of this number could be very 
different. In contrast, the number E/vOE*), which denotes the number of positive-integral 
lattice points lying on or within the surface of a 31V-dimensional sphere of radius *]E*, 
will be much less irregular. In terms of our physical problem, this would correspond to 
the number, E (N, V, E), of microstates of the given system consistent with all macrostates 
characterized by the specified values of the parameters N and V but having energy less 

'“Combining (10) with (2), we obtain for the classical ideal gas: E = \NkT. Accordingly, equation (9) reduces to the 
well-known thermodynamic relationship: V y ~ 1 T = const., which holds during a reversible adiabatic process, with y = I. 

11 If the particles are regarded as indistinguishable, the evaluation of the number £2 by counting lattice points becomes 
quite intricate. The problem is then solved by having recourse to the theory of “partitions of numbers”; see Auluck and 
Kothari (1946). 
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than or equal to E; that is, 


£(AT, V,E) = ^ Q,(N,V,E') 

e'<e 


( 12 ) 


or 


Xn(E*)= J2 q n(E*'). ( 13 ) 

£*'<£* 

Of course, the number £ will also be somewhat irregular; however, we expect that its 
asymptotic behavior, as E* oo, will be a lot smoother than that of Q. We shall see in 
the sequel that the thermodynamics of the system follows equally well from the number £ 
as from O. 

To appreciate the point made here, let us digress a little to examine the behavior of 
the numbers Oi(e*) and £i(e*), which correspond to the case of a single particle con¬ 
fined to the given volume V. The exact values of these numbers, for e* < 10,000, can be 
extracted from a table compiled by Gupta (1947). The wild irregularities of the number 
S2i(e*) can hardly be missed. The number £i(e*), on the other hand, exhibits a much 
smoother asymptotic behavior. From the geometry of the problem, we note that, asymp¬ 
totically, £i(e*) should be equal to the volume of an octant of a three-dimensional sphere 
of radius Jr*, that is, 


lim 


Ei(e*) 


= 1. 


e* —^oo (7r/6)g* 3 / 2 

A more detailed analysis shows that, to the next approximation (see Pathria, 1966), 

c *3/2 _ ; 


£l(£*b 


6 


8 


( 14 ) 


( 15 ) 


the correction term arises from the fact that the volume of an octant somewhat overes¬ 
timates the number of desired lattice points, for it includes, partly though, some points 
with one or more coordinates equal to zero. Figure 1.2 shows a histogram of the actual val¬ 
ues of £i(e*) for s* lying between 200 and 300; the theoretical estimate (15) is also shown. 
In the figure, we have also included a histogram of the actual values of the corresponding 
number of microstates, £', (e*), when the quantum numbers n x , n y , and n z can assume the 
value zero as well. In the latter case, the volume of an octant somewhat underestimates the 
number of desired lattice points; we now have 


Si(e*) 


^ *3/2 

6 


+ 



( 16 ) 


Asymptotically, however, the number £^(e*) also satisfies equation (14). 

Returning to the Ai-particle problem, the number T,n(E*) should be asymptotically 
equal to the “volume” of the “positive compartment” of a 3A r -dimensional sphere of 
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FIGURE 1.2 Histograms showing the actual number of microstates available to a particle in a cubical enclosure; the 
lower histogram corresponds to the so-called Dirichlet boundary conditions, while the upper one corresponds to 
the Neumann boundary conditions (see Appendix A). The corresponding theoretical estimates, (15) and (16), are 
shown by dashed lines; the customary estimate, equation (14), is shown by a solid line. 


radius ~JE*. Referring to equation (C.7a) of Appendix C, we obtain 


'i\3JV| JJ.3N/2 

,2/ |(3JV/2)! 


which, on substitution for E*, gives 


E(AT, V,E) 


/ V\ N (2nmE) m / 2 
\¥) (3JV/2)! 


Taking logarithms and applying Stirling’s formula, (B.29) in Appendix B, 


(17) 


ln(«!) « nlnn— n («» 1), 


(18) 
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we get 


In £ (TV, V,E) 




(19) 


For deriving the thermodynamic properties of the given system we must somehow fix 
the precise value of, or limits for, the energy of the system. In view of the extremely irreg¬ 
ular nature of the function Q(N, V,E), the specification of a precise value for the energy 
of the system cannot be justified on physical grounds, for that would never yield well- 
behaved expressions for the thermodynamic functions of the system. From a practical 
point of view, too, an absolutely isolated system is too much of an idealization. In the real 
world, almost every system has some contact with its surroundings, however little it may 
be; as a result, its energy cannot be defined sharply. 12 Of course, the effective width of the 
range over which the energy may vary would, in general, be small in comparison with the 
mean value of the energy. Let us specify this range by the limits (e~ ^ and (e + \ A^ 
where, by assumption, A <^£; typically, A/E = 0(l/^/N). The corresponding number of 
microstates, F(N, V,E; A), is then given by 


r(AT, V,E; A) 


di:(N,V,E) 3N A 

— a ^- j:(N,v,E), 

a E 2 E 


(17a) 


which gives 


Inf (AT, V,E\ A) 


AT In 


V 

h 3 


4nmE 

3N 


3/2' 


4 N+ 



(19a) 


Now, for N^l, the first term in the curly bracket is negligible in comparison with any 
of the terms outside this bracket, for lim ( lnN)/N — 0. Furthermore, for any reasonable 

N^-OO 

value of A /E, the same is true of the second term in this bracket. 13 Lienee, for all practical 
purposes, 


lnf^lnE 


AT In 


r ^ 

( An mE \ 3//2 

h 3 

V 3/V ) 


+ V 


( 20 ) 


We thus arrive at the baffling result that, for all practical purposes, the actual width of the 
range allowed for the energy of the system does not make much difference; the energy 
could lie between (e — ^ A^) and (e+ ^A^ or equally well between 0 and E. The reason 
underlying this situation is that the rate at which the number of microstates of the system 


12 Actually, the very act of making measurements on a system brings about, inevitably, a contact between the system 
and its surroundings. 

13 It should be clear that, while A/E is much less than 1, it must not tend to 0, for that would make F -> 0 and Inf -> 
—oo. A situation of that kind would be too artificial and would have nothing to do with reality. Actually, in most physical 
systems, A /E= 0(N~ 1 / 2 ), whereby ln(A/£) becomes of order lnAT, which again is negligible in comparison with the 
terms outside the curly bracket. 
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increases with energy is so fantastic, see equation (17), that even if we allow all values of 
energy between zero and a particular value E, it is only the “immediate neighborhood" of 
E that makes an overwhelmingly dominant contribution to this number! And since we 
are finally concerned only with the logarithm of this number, even the “width” of that 
neighborhood is inconsequential! 

The stage is now set for deriving the thermodynamics of our system. First of all, we 
have 


S(N, V,E) = klnf = Nkln 
which can be inverted to give 



(4nmE \ 3/2 " 


l 3N ) 


+l m - 


( 21 ) 


14 


3 h 2 N ( 2S 

E(S, V,N) =-— exp -- - 1 

AjtmV 2 ^ 1 \ 3Nk 


( 22 ) 


The temperature of the gas then follows with the help of formula (1.3.10) or (1.3.13), 
which leads to the energy-temperature relationship 


b =KH 



(23) 


where n is the number of moles of the gas. The specific heat at constant volume now 
follows with the help of formula (1.3.17): 


Cy = 




(24) 


For the equation of state, we obtain 


/ dE\ _2E 

P -\W) N ' S -3V’ 

which agrees with our earlier result (10). Combined with (23), this gives 

P = or PV — nRT, 


(25) 


(26) 


which is the same as (3). The specific heat at constant pressure is given by, see (1.3.18), 


C P = 


/ d(E + PV) \ 

\ 3 T ) NP 



(27) 


14 Henceforth, we shall replace the sign which characterizes the asymptotic character of a relationship, by the sign 
of equality because for most physical systems the asymptotic results are as good as exact. 
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so that, for the ratio of the two specific heats, we have 

y = C P /Cv=|. (28) 

Now, suppose that the gas undergoes an isothermal change of state (T = const, and 
N — const.); then, according to (23), the total energy of the gas would remain constant 
while, according to (26), its pressure would vary inversely with volume (Boyle’s law). The 
change in the entropy of the gas, between the initial state i and the final state/, would then 
be, see equation (21), 


S f -Si = Nk\n(V f /Vi). (29) 

On the other hand, if the gas undergoes a reversible adiabatic change of state (S = const, 
and N — const.), then, according to (22) and (23), both E and T would vary as V~ 2 ^\ so, 
according to (25) or (26), P would vary as V~ 5 / 3 . These results agree with the conventional 
thermodynamic ones, namely 

PV V = const, and TV Y ~ l — const., (30) 

with y = |. It may be noted that, thermodynamically, the change in E during an adiabatic 
process arises solely from the external work done by the gas on the surroundings or vice 
versa: 

2 e 

(dE) adiah = -PdV=-—dV; (31) 

see equations (1.3.4) and (25). The dependence of E on V follows readily from this 
relationship. 

The considerations of this section have clearly demonstrated the manner in which 
the thermodynamics of a macroscopic system can be derived from the multiplicity of its 
microstates (as represented by the number O or T or £). The whole problem then hinges 
on an asymptotic enumeration of these numbers, which unfortunately is tractable only 
in a few idealized cases, such as the one considered in this section; see also Problems 1.7 
and 1.8. Even in an idealized case like this, there remains an inadequacy that could not be 
detected in the derivations made so far; this relates to the explicit dependence of S on N. 
The discussion of the next section is intended not only to bring out this inadequacy but 
also to provide the necessary remedy for it. 


1.5 The entropy of mixing and the Gibbs 
paradox 

One thing we readily observe from expression (1.4.21) is that, contrary to what is logi¬ 
cally desired, the entropy of an ideal gas, as given by this expression, is not an extensive 
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(N u Vi; T) 


(N z , V 2 \ T) 


FIGURE 1.3 The mixing together of two ideal gases 1 and 2. 


property of the system! That is, if we increase the size of the system by a factor a, keep¬ 
ing the intensive variables unchanged, 15 then the entropy of the system, which should 
also increase by the same factor a, does not do so; the presence of the In V term in the 
expression affects the result adversely. This in a way means that the entropy of this system 
is different from the sum of the entropies of its parts, which is quite unphysical. A more 
common way of looking at this problem is to consider the so-called Gibbs paradox. 

Gibbs visualized the mixing of two ideal gases 1 and 2, both being initially at the same 
temperature T; see Figure 1.3. Clearly, the temperature of the mixture would also be the 
same. Now, before the mixing took place, the respective entropies of the two gases were, 
see equations (1.4.21) and (1.4.23), 


3 I (2nmikT\ ] 

S i = N t k\nV i +-N i k l + ln( j ; z = 1,2. (1) 

After the mixing has taken place, the total entropy would be 

2 

s t =J2 

1=1 

where V — V\ + V 2 . Thus, the net increase in the value of S, which maybe called the entropy 
of mixing, is given by 


,, 3,1 , / 2 nmikT 

N{k In V + - Nik ! 1 + In ( —p— 


( 2 ) 


2 

(AS) = Sj — T> = k 
i= 1 


Ni In 


Vi + Vz 

Vi 


+ A^ln 


Vi + V 2 \ 

V 2 J’ 


(3) 


the quantity AS is indeed positive, as it must be for an irreversible process like mixing. 
Now, in the special case when the initial particle densities of the two gases (and, hence, the 
particle density of the mixture) are also the same, equation (3) becomes 


(AS)* = lc 


N\ In 


Ni+N 2 

Ni 


+ N2ln 


Ni+N 2 

N 2 


(4) 


which is again positive. 

15 This means an increase of the parameters N, V, and E to aN, a V, and aE, so that the energy per particle and the 
volume per particle remain unchanged. 
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So far, it seems all right. However, a paradoxical situation arises if we consider the mix¬ 
ing of two samples of the same gas. Once again, the entropies of the individual samples 
will be given by (1); of course, now mi = m 2 = m, say. And the entropy after mixing will be 
given by 



(2a) 


where N = Ni + Afc; note that this expression is numerically the same as (2), with m ( - = m. 
Therefore, the entropy of mixing in this case will also be given by expression (3) and, if 
N 1 /V 1 = N 2 /V 2 = (Aii + AfeVfVi + V 2 ), by expression (4). The last conclusion, however, is 
unacceptable because the mixing of two samples of the same gas, with a common initial 
temperature T and a common initial particle density n, is clearly a reversible process, for 
we can simply reinsert the partitioning wall into the system and obtain a situation that is 
in no way different from the one we had before mixing. Of course, we tacitly imply that 
in dealing with a system of identical particles we cannot track them down individually; 
all we can reckon with is their numbers. When two dissimilar gases, even with a common 
initial temperature T, and a common initial particle density n, mixed together the process 
was irreversible, for by reinserting the partitioning wall one would obtain two samples of 
the mixture and not the two gases that were originally present; to that case, expression (4) 
would indeed apply. However, in the present case, the corresponding result should be 


(4a) 16 


(AS) 1=2 - 0- 


The foregoing result would also be consistent with the requirement that the entropy of a 
given system is equal to the sum of the entropies of its parts. Of course, we had already 
noticed that this is not ensured by expression (1.4.21). Thus, once again we are led to 
believe that there is something basically wrong with that expression. 

To see how the above paradoxical situation can be avoided, we recall that, for the 
entropy of mixing of two samples of the same gas, with a common T and a common n, 
we were led to result (4), which can also be written as 


(AS)* = S r - (Si + S 2 ) « fc[ln{(lVi + AT 2 )!) - ln(N x \) - ln(JV 2 !)] 


(4) 


instead of the logical result (4a). A closer look at this expression shows that we would 
indeed obtain the correct result if our original expression for S were diminished by an 
ad hoc term, k\n(N\), for that would diminish Si by kln(Ah!),S 2 by fcln(A( 2 !) and St by 
kln{(Ni + N 2 )\}, with the result that (AS)* would turn out to be zero instead of the expres¬ 
sion appearing in (4). Clearly, this would amount to an ad hoc reduction of the statistical 
numbers T and E by a factor N\. This is precisely the remedy proposed by Gibbs to avoid 
the paradox in question. 

16 In view of this, we fear that expression (3) may also be inapplicable to this case. 
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If we agree with the foregoing suggestion, then the modified expression for the entropy 
of a classical ideal gas would be 


S(N, V,E) = Nk In 


= Nk1n 


V / 4nmE\ 3/2 

Nh 2 V 3 N J 


+ -Nk 
2 


+ -Nkl - + ln 


( 2nmkT\ 
\~hT- )(' 


(1.4.21a) 


(la) 


which indeed is truly extensive! If we now mix two samples of the same gas at a common 
initial temperature T, the entropy of mixing would be 


(AS) 1=2 = k 


(N\ + A^fln 


m 

Ni+N 2 J 


— N2ln 


(3a) 


and, if the initial particle densities of the samples were also equal, the result would be 


(AS) ls2 — 0 . 


(4a) 


It may be noted that for the mixing of two dissimilar gases, the original expressions (3) and 
(4) would continue to hold even when (1.4.21) is replaced by (1.4.21a). 17 The paradox of 
Gibbs is thereby resolved. 

Equation (la) is generally referred to as the Sackur-Tetrode equation. We reiterate the 
fact that, by this equation, the entropy of the system does indeed become a truly extensive 
quantity. Thus, the very root of the trouble has been eliminated by the recipe of Gibbs. We 
shall discuss the physical implications of this recipe in Section 1.6; here, let us jot down 
some of its immediate consequences. 

First of all, we note that the expression for the energy E of the gas, written as a function 
of N, V, and S, is also modified. We now have 


E(N, V,S) 


3h 2 N 5 / 3 / 2 S 5 \ 
4;r mV 2 / 3 eXP \3JVfc “ 3 )’ 


(1.4.22a) 


which, unlike its predecessor (1.4.22), makes energy too a truly extensive quantity. Of 
course, the thermodynamic results (1.4.23) through (1.4.31), derived in the previous 
section, remain unchanged. However, there are some that were intentionally left out, for 
they would come out correct only from the modified expression for S(N, V,E) or E(S, V,N). 
The most important of these is the chemical potential of the gas, for which we obtain 



E 

' 5 

2 S " 

\3 N / 

K,s 

_3N 

3N 2 k_ 


17 Because, in this case, the entropy St of the mixture would be diminished by i;ln(lVi!]V 2 !), rather than by 
fcln((JVi +JV 2 )!). 
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In view of equations (1.4.23) and (1.4.25), this becomes 



(6) 


where G is the Gibbs free energy of the system. In terms of the variables N, V, and T, 
expression (5) takes the form 



(7) 


Another quantity of importance is the Helmholtz free energy: 



( 8 ) 


It will be noted that, while A is an extensive property of the system, /i is intensive. 


1.6 The "correct" enumeration of the microstates 


In the preceding section we saw that an ad hoc diminution in the entropy of an iV-particle 
system by an amount kln(N\), which implies an ad hoc reduction in the number of 
microstates accessible to the system by a factor (IV!), was able to correct the unphysical fea¬ 
tures of some of our former expressions. It is now natural to ask: why, in principle, should 
the number of microstates, computed in Section 1.4, be reduced in this manner? The phys¬ 
ical reason for doing so is that the particles constituting the given system are not only 
identical but also indistinguishable ; accordingly, it is unphysical to label them as No. 1, 
No. 2, No. 3, and so on and to speak of their being individually in the various single-particle 
states All we can sensibly speak of is their distribution over the states s,- by numbers, that 
is, Hi particles being in the state ei, n 2 in the state s 2 , and so on. Thus, the correct way of 
specifying a microstate of the system is through the distribution numbers {nj}, and not 
through the statement as to “which particle is in which state.” To elaborate the point, we 
may say that if we consider two microstates that differ from one another merely in an inter¬ 
change of two particles in different energy states, then according to our original mode of 
counting we would regard these microstates as distinct; in view of the indistinguishability 
of the particles, however, these microstates are not distinct (for, physically, there exists no 
way whatsoever of distinguishing between them). 18 

18 Of course, if an interchange took place among particles in the same energy state, then even our original mode of 
counting did not regard the two microstates as distinct. 








1.6 The “correct” enumeration of the microstates 21 


Now, the total number of permutations that can be effected among N particles, 
distributed according to the set {n z }, is 


where the n, must be consistent with the basic constraints (1.1.1) and (1.1.2). 19 If our parti¬ 
cles were distinguishable, then all these permutations would lead to “distinct” microstates. 
However, in view of the indistinguishability of the particles, these permutations must be 
regarded as leading to one and the same thing; consequently, for any distribution set {«;}, 
we have one, and only one, distinct microstate. As a result, the total number of distinct 
microstates accessible to the system, consistent with a given macrostate ( N, V, E), would 
be severely cut down. However, since factor (1) itself depends on the numbers n, consti¬ 
tuting a particular distribution set and for a given macrostate there will be many such sets, 
there is no straightforward way to “correct down” the number of microstates computed on 
the basis of the classical concept of “distinguishability” of the particles. 

The recipe of Gibbs clearly amounts to disregarding the details of the numbers n t and 
slashing the whole sequence of microstates by a common factor Nl; this is correct for situa¬ 
tions in which all N particles happen to be in different energy states but is certainly wrong 
for other situations. We must keep in mind that by adopting this recipe we are still using a 
spurious weight factor, 


for the distribution set {n z } whereas in principle we should use a factor of unity, irre¬ 
spective of the values of the numbers n z . 20 Nonetheless, the recipe of Gibbs does correct 
the situation in a gross manner, though in matters of detail it is still inadequate. In fact, 
it is only by taking w{nj} to be equal to unity (or zero) that we obtain true quantum 
statistics'. 

We thus see that the recipe of Gibbs corrects the enumeration of the microstates, as 
necessitated by the indistinguishability of the particles, only in a gross manner. Numeri¬ 
cally, this would approach closer and closer to reality as the probability of the n, being 
greater than 1 becomes less and less. This in turn happens when the given system is 
at a sufficiently high temperature (so that many more energy states become accessible) 
and has a sufficiently low density (so that there are not as many particles to accommo¬ 
date). It follows that the “corrected” classical statistics represents truth more closely if the 
expectation values of the occupation numbers nj are much less than unity: 


<n/>«1, (3) 

19 The presence of the factors («,-!) in the denominator is related to the comment made in the preceding note. 

20 Or a factor of zero if the distribution set j n,} is disallowed on certain physical grounds, such as the Pauli exclusion 
principle. 
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that is, if the numbers tij are generally 0, occasionally 1, and rarely greater than 1. Condition 
(3) in a way defines the classical limit. We must, however, remember that it is because of the 
application of the correction factor 1/iV!, which replaces (1) by (2), that our results agree 
with reality at least in the classical limit. 

In Section 5.5 we shall demonstrate, in an independent manner, that the factor by 
which the number of microstates, as computed for the “labeled” molecules, be reduced so 
that the formalism of classical statistical mechanics becomes a true limit of the formalism 
of quantum statistical mechanics is indeed Nl. 


Problems 

1.1. (a) Show that, for two large systems in thermal contact, the number tl (0) (£' (0) ,h'i) of Section 1.2 


can be expressed as a Gaussian in the variable E\. Determine the root-mean-square deviation 
of £i from the mean value E\ in terms of other quantities pertaining to the problem. 

(b) Make an explicit evaluation of the root-mean-square deviation of E\ in the special case when 
the systems Ai and A 2 are ideal classical gases. 

1.2. Assuming that the entropy S and the statistical number £2 of a physical system are related through 
an arbitrary functional form 


s=fm 


show that the additive character of S and the multiplicative character of £2 necessarily require that 
the function/(£2) be of the form (1.2.6). 

1.3. Two systems A and B, of identical composition, are brought together and allowed to exchange both 
energy and particles, keeping volumes V A and Vg constant. Show that the minimum value of the 
quantity (dE A /dN A ) is given by 


I^aTb ~ ti B T A 
Tb~T a 


where the /r’s and the T’s are the respective chemical potentials and temperatures. 

1.4. In a classical gas of hard spheres (of diameter D), the spatial distribution of the particles is no 
longer uncorrelated. Roughly speaking, the presence of n particles in the system leaves only 
a volume ( V - nu 0 ) available for the (n + l)th particle; clearly, v 0 would be proportional to 
D 3 . Assuming that Nvo <K V, determine the dependence of £2 (N, V, E) on V (compare to 
equation (1.4.1)) and show that, as a result of this, V in the ideal-gas law (1.4.3) gets replaced 
by ( V - b), where b is four times the actual volume occupied by the particles. 

1.5. Read Appendix A and establish formulae (1.4.15) and (1.4.16). Estimate the importance of the 
linear term in these formulae, relative to the main term (jr/6)e* 3 / 2 , for an oxygen molecule 
confined to a cube of side 10 cm; take e = 0.05 eV. 

1.6. A cylindrical vessel 1 m long and 0.1 m in diameter is filled with a monatomic gas at P = 1 atm and 
T = 300 K. The gas is heated by an electrical discharge, along the axis of the vessel, which releases 
an energy of 10 4 joules. What will the temperature of the gas be immediately after the discharge? 

1.7. Study the statistical mechanics of an extreme relativisitic gas characterized by the single-particle 
energy states 



instead of (1.4.5), along the lines followed in Section 1.4. Show that the ratio Cp/Cv in this case is 
4/3, instead of 5/3. 

1.8. Consider a system of quasiparticles whose energy eigenvalues are given by 


E(n) = nhv\ 71 = 0,1,2,.... 
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Obtain an asymptotic expression for the number Q of this system for a given number N of the 
quasiparticles and a given total energy E. Determine the temperature T of the system as a function 
of E/N and hv, and examine the situation for which E/(Nhv) 1. 

1.9. Making use of the fact that the entropy S(N, V,E) of a thermodynamic system is an extensive 
quantity, show that 



Note that this result implies that (—N/i + PV + E)/T = S, that is, N/i =E+ PV - TS. 

1.10. A mole of argon and a mole of helium are contained in vessels of equal volume. If argon is at 300 K, 
what should the temperature of helium be so that the two have the same entropy? 

1.11. Four moles of nitrogen and one mole of oxygen at P = 1 atm and T = 300K are mixed together to 
form air at the same pressure and temperature. Calculate the entropy of mixing per mole of the air 
formed. 

1.12. Show that the various expressions for the entropy of mixing, derived in Section 1.5, satisfy the 
following relations: 

(a) For all N^V^N-i, and V 2 , 

(AS)i= 2 = {(AS) — (AS)*} > 0, 

the equality holding when and only when N\/V\ = N 2 /V 2 . 

(b) For a given value of (N\ + N 2 ), 

(AS)* < (Ni + N 2 )kln2, 

the equality holding when and only when N\ =N 2 . 

1.13. If the two gases considered in the mixing process of Section 1.5 were initially at different 
temperatures, say T\ and T 2 , what would the entropy of mixing be in that case? Would the 
contribution arising from this cause depend on whether the two gases were different or identical? 

1.14. Show that for an ideal gas composed of monatomic molecules the entropy change, between any 
two temperatures, when the pressure is kept constant is 5/3 times the corresponding entropy 
change when the volume is kept constant. Verify this result numerically by calculating the actual 
values of (AS )p and (AS) v per mole of an ideal gas whose temperature is raised from 300 K to 400 K. 

1.15. We have seen that the CP, V) -relationship during a reversible adiabatic process in an ideal gas is 
governed by the exponent y, such that 


PV r = const. 


Consider a mixture of two ideal gases, with mole fractions and f 2 and respective exponents yi 
and y 2 . Show that the effective exponent y for the mixture is given by 


K — 1 yi - 1 Y2 1 


i h , h 


1.16. Establish thermodynamically the formulae 



and 



Express the pressure P of an ideal classical gas in terms of the variables \i and T, and verify the 
above formulae. 






Elements of Ensemble Theory 


In the preceding chapter we noted that, for a given macrostate (N,V,E), a statistical 
system, at any time t, is equally likely to be in any one of an extremely large number of 
distinct microstates. As time passes, the system continually switches from one microstate 
to another, with the result that, over a reasonable span of time, all one observes is a behav¬ 
ior “averaged” over the variety of microstates through which the system passes.lt may, 
therefore, make sense if we consider, at a single instant of time, a rather large number of 
systems — all being some sort of “mental copies” of the given system — which are charac¬ 
terized by the same macrostate as the original system but are, naturally enough, in all sorts 
of possible microstates. Then, under ordinary circumstances, we may expect that the aver¬ 
age behavior of any system in this collection, which we call an ensemble, would be identical 
to the time-averaged behavior of the given system. It is on the basis of this expectation that 
we proceed to develop the so-called ensemble theory. 

For classical systems, the most appropriate framework for developing the desired for¬ 
malism is provided by the phase space. Accordingly, we begin our study of the various 
ensembles with an analysis of the basic features of this space. 


2.1 Phase space of a classical system 

The microstate of a given classical system, at any time t, may be defined by specifying the 
instantaneous positions and momenta of all the particles constituting the system. Thus, 
if N is the number of particles in the system, the definition of a microstate requires the 
specification of 3 N position coordinates < 71 , <72 7 - - -»<73iv and 3JV momentum coordinates 
pi,p 2 ,---,pm- Geometrically, the set of coordinates (qi,pp, where i— 1,2,...,31V, maybe 
regarded as a point in a space of 6 N dimensions. We refer to this space as the phase space, 
and the phase point (q t , pp as a representative point, of the given system. 

Of course, the coordinates q, and p, are functions of the time t; the precise manner in 
which they vary with t is determined by the canonical equations of motion, 


m = 


Pi = ~ 


dHjqupi) 

dPi 

dH(qi,pi ) 


3 'hi 


i= 1,2, ...,31V, 


( 1 ) 


where H(qi,pp is the Hamiltonian of the system. Now, as time passes, the set of 
coordinates (q,-, pp, which also defines the microstate of the system, undergoes a continual 
change. Correspondingly, our representative point in the phase space carves out a 
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trajectory whose direction, at any time t, is determined by the velocity vector v = (), 
which in turn is given by the equations of motion (1). It is not difficult to see that the 
trajectory of the representative point must remain within a limited region of the phase 
space; this is so because a finite volume V directly limits the values of the coordinates q u 
while a finite energy £ limits the values of both the q t and the p, [through the Hamiltonian 
H(qi, pi)]. In particular, if the total energy of the system is known to have a precise value, 
say E, the corresponding trajectory will be restricted to the “hypersurface” 


H(qrPi) =E (2) 

of the phase space; on the other hand, if the total energy may lie anywhere in the range 
(E — | A,E+ 2 A), the corresponding trajectory will be restricted to the “hypershell” 
defined by these limits. 

Now, if we consider an ensemble of systems (i.e., the given system, along with a large 
number of mental copies of it) then, at any time t, the various members of the ensem¬ 
ble will be in all sorts of possible microstates; indeed, each one of these microstates must 
be consistent with the given macrostate that is supposed to be common to all members 
of the ensemble. In the phase space, the corresponding picture will consist of a swarm of 
representative points, one for each member of the ensemble, all lying within the “allowed” 
region of this space. As time passes, every member of the ensemble undergoes a continual 
change of microstates; correspondingly, the representative points constituting the swarm 
continually move along their respective trajectories. The overall picture of this movement 
possesses some important features that are best illustrated in terms of what we call a 
density function p(q,p; t). 1 This function is such that, at any time t, the number of repre¬ 
sentative points in the “volume element” ( d 3N qd 3N p) around the point ( q,p) of the phase 
space is given by the product p{q,p-,t)d 3N qd 3N p. Clearly, the density function p(q,p;t) 
symbolizes the manner in which the members of the ensemble are distributed over all 
possible microstates at different instants of time. Accordingly, the ensemble average (/) of a 
given physical quantity/!*?, p), which may be different for systems in different microstates, 
would be given by 


/ fid’ P)p( < b P'< t)d 3N qd 3N p 
f p(q,p-,t)d 3N qd 3N p 


The integrations in (3) extend over the whole of the phase space; however, it is only 
the populated regions of the phase space (p / 0) that really contribute. We note that, in 
general, the ensemble average (/> may itself be a function of time. 

An ensemble is said to be stationary if p does not depend explicitly on time, that is, at 
all times 



(4) 


Note that ( q,p) is an abbreviation of ( qi,pp = (q 1,.. .,<j3At,pi,...,p3jv). 
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Clearly, for such an ensemble the average value (/> of any physical quantity f(q, p) will 
be independent of time. Naturally, a stationary ensemble qualifies to represent a system in 
equilibrium. To determine the circumstances under which equation (4) may hold, we have 
to make a rather detailed study of the movement of the representative points in the phase 
space. 


2.2 Liouville's theorem and its consequences 


Consider an arbitrary “volume” to in the relevant region of the phase space and let the 
“surface” enclosing this volume be denoted by a; see Figure 2.1. Then, the rate at which 
the number of representative points in this volume increases with time is written as 



( 1 ) 


where dco = ( d 3N qd 3N p ). On the other hand, the net rate at which the representative points 
“flow” out of to (across the bounding surface a) is given by 



( 2 ) 


a 


here, v is the velocity vector of the representative points in the region of the surface 
element da while h is the (outward) unit vector normal to this element. By the divergence 
theorem, (2) can be written as 


J div(pv)d<w, 


(3) 


of course, the operation of divergence here means 



(4) 



n 


v 


A 


FIGURE 2.1 The "hydrodynamics" of the representative points in the phase space. 
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In view of the fact that there are no “sources” or “sinks” in the phase space and hence the 
total number of representative points remains conserved, 2 we have, by (1) and (3), 


3 

at 



J div(pv)dco, 

CO 


(5) 


that is, 


/! 


dp 

dt 


+ div(pi>) \da> = 0. 


( 6 ) 


Now, the necessary and sufficient condition that integral (6) vanish for all arbitrary 
volumes to is that the integrand itself vanish everywhere in the relevant region of the phase 
space. Thus, we must have 


— +div(pi/) = 0, 

Of 

which is the equation of continuity for the swarm of the representative points. 
Combining (4) and (7), we obtain 


dp 

dt 




a 4 / 
dqi 


+ 



= 0. 


(7) 


( 8 ) 


The last group of terms vanishes identically because, by the equations of motion, we have, 
for all i, 


deg _ d 2 H(q i} pi) = 3 2 H(q i ,p i ) dpi 

dqi dqidpi ~ dpidqi dpi 


Further, since p = p(q,p\t), the remaining terms in (8) may be combined to form the 
“total” time derivative of p, with the result that 


dp 

dt 


dp 

dt 


+ [p,H] = 0. 


( 10) 3 


Equation (10) embodies Liouville’s theorem (1838). According to this theorem, the “local” 
density of the representative points, as viewed by an observer moving with a representa¬ 
tive point, stays constant in time. Thus, the swarm of the representative points moves in 


2 This means that in the ensemble under consideration neither are any new members being added nor are any old 
ones being removed. 

3 We recall that the Poisson bracket [p,H] stands for the sum 



dp d H 
dq t d pi 


dp dH \ 
dpi dq t ) 


which is identical to the group of terms in the middle of ( 8 ). 
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the phase space in essentially the same manner as an incompressible fluid moves in the 
physical space! 

A distinction must be made, however, between equation (10) on one hand and 
equation (2.1.4) on the other. While the former derives from the basic mechanics of the 
particles and is therefore quite generally true, the latter is only a requirement for equi¬ 
librium which, in a given case, may or may not be satisfied. The condition that ensures 
simultaneous validity of the two equations is clearly 



( 11 ) 


Now, one possible way of satisfying (11) is to assume that p, which is already assumed 
to have no explicit dependence on time, is independent of the coordinates ( q,p ) as well, 
that is, 


p(q,p) = const. (12) 

over the relevant region of the phase space (and, of course, is zero everywhere else). Physi¬ 
cally, this choice corresponds to an ensemble of systems that at all times are uniformly 
distributed over all possible microstates. The ensemble average (2.1.3) then reduces to 

(/) = - f f(q,p)dco; (13) 

o> J 

CO 

here, u> denotes the total “volume” of the relevant region of the phase space. Clearly, in 
this case, any member of the ensemble is equally likely to be in any one of the various 
possible microstates, inasmuch as any representative point in the swarm is equally likely 
to be in the neighborhood of any phase point in the allowed region of the phase space. 
This statement is usually referred to as the postulate of “equal a priori probabilities” for 
the various possible microstates (or for the various volume elements in the allowed region 
of the phase space); the resulting ensemble is referred to as the microcanonical ensemble. 

A more general way of satisfying (11) is to assume that the dependence of p on ( q,p ) 
comes only through an explicit dependence on the Hamiltonian H(q, p), that is, 

p(q,p) = p[H(q,p)]; (14) 

condition (11) is then identically satisfied. Equation (14) provides a class of density func¬ 
tions for which the corresponding ensemble is stationary. In Chapter 3 we shall see that 
the most natural choice in this class of ensembles is the one for which 

p(q,p) oc.exp[-H(q,p)/kT]. (15) 


The ensemble so defined is referred to as the canonical ensemble. 
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2.3 The microcanonical ensemble 

In this ensemble the macrostate of a system is defined by the number of molecules N, 
the volume V, and the energy E. However, in view of the considerations expressed in 
Section 1.4, we may prefer to specify a range of energy values, say from (E — jA) to 
(E+ 2 A), rather than a sharply defined value E. With the macrostate specified, a choice 
still remains for the systems of the ensemble to be in any one of a large number of pos¬ 
sible microstates. In the phase space, correspondingly, the representative points of the 
ensemble have a choice to lie anywhere within a “hypershell” defined by the condition 

(e -^ <H(q,p) < (e+ (1) 

The volume of the phase space enclosed within this shell is given by 


a> = J dm = J (d 3N qd 3N p^, (2) 

where the primed integration extends only over that part of the phase space which con¬ 
forms to condition (1). It is clear that to will be a function of the parameters N,V,E, 
and A. 

Now, the microcanonical ensemble is a collection of systems for which the density 
function p is, at all times, given by 


p(q,p) = const, if (E - J;A) < H{q,p) < ^E + 
0 otherwise 


(3) 


Accordingly, the expectation value of the number of representative points lying in a vol¬ 
ume element dto of the relevant hypershell is simply proportional to dm. In other words, the 
a priori probability of finding a representative point in a given volume element dm is the 
same as that of finding a representative point in an equivalent volume element dm located 
anywhere in the hypershell. In our original parlance, this means an equal a priori probabil¬ 
ity for a given member of the ensemble to be in any one of the various possible microstates. 
In view of these considerations, the ensemble average {/), as given by equation (2.2.13), 
acquires a simple physical meaning. To see this, we proceed as follows. 

Since the ensemble under study is a stationary one, the ensemble average of any phy¬ 
sical quantity/ will be independent of time; accordingly, taking a time average thereof will 
not produce any new result. Thus 


{/) = the ensemble average of/ 


= the time average of (the ensemble average of/). 
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Now, the processes of time averaging and ensemble averaging are completely indepen¬ 
dent, so the order in which they are performed may be reversed without causing any 
change in the value of (/). Thus 

(/) = the ensemble average of (the time average of/). 

Now, the time average of any physical quantity, taken over a sufficiently long interval of 
time, must be the same for every member of the ensemble, for after all we are dealing 
with only mental copies of a given system. 4 Therefore, taking an ensemble average thereof 
should be inconsequential, and we may write 

(/) = the long-time average of/, 

where the latter may be taken over any member of the ensemble. Furthermore, the long¬ 
time average of a physical quantity is all one obtains by making a measurement of that 
quantity on the given system; therefore, it may be identified with the value one expects to 
obtain through experiment. Thus, we finally have 


(/) —/exp- (4) 

This brings us to the most important result: the ensemble average of any physical quantity 
f is identical to the value one expects to obtain on making an appropriate measurement on 
the given system. 

The next thing we look for is the establishment of a connection between the mechanics 
of the microcanonical ensemble and the thermodynamics of the member systems. To do 
this, we observe that there exists a direct correspondence between the various microstates 
of the given system and the various locations in the phase space. The volume co (of the 
allowed region of the phase space) is, therefore, a direct measure of the multiplicity T of the 
microstates accessible to the system. To establish a numerical correspondence between co 

4 To provide a rigorous justification for this assertion is not trivial. One can readily see that if, for any particular mem¬ 
ber of the ensemble, the quantity / is averaged only over a short span of time, the result is bound to depend on the 
relevant “subset of microstates'’ through which the system passes during that time. In the phase space, this will mean 
an averaging over only a “part of the allowed region.” However, if we employ instead a sufficiently long interval of time, 
the system may be expected to pass through almost all possible microstates “without fear or favor”; consequently, the 
result of the averaging process would depend only on the macrostate of the system, and not on a subset of microstates. 
Correspondingly, the averaging in the phase space would go over practically all parts of the allowed region, again “with¬ 
out fear or favor.” In other words, the representative point of our system will have traversed each and every part of the 
allowed region almost uniformly. This statement embodies the so-called ergodic theorem or ergodic hypothesis, which 
was first introduced by Boltzmann (1871). According to this hypothesis, the trajectory of a representative point passes, 
in the course of time, through each and every point of the relevant region of the phase space. A little reflection, however, 
shows that the statement as such requires a qualification; we better replace it by the so-called quasiergodic hypothesis, 
according to which the trajectory of a representative point traverses, in the course of time, any neighborhood of any point 
of the relevant region. For further details, see ter Haar (1954, 1955), Farquhar (1964). 

Now, when we consider an ensemble of systems, the foregoing statement should hold for every member of the 
ensemble; thus, irrespective of the initial (and final) states of the various systems, the long-time average of any physical 
quantity/ should be the same for every member system. 
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and T, we need to discover a fundamental volume a> o that could be regarded as “equivalent 
to one microstate.” Once this is done, we may say that, asymptotically, 

f = (o/coo- (5) 

The thermodynamics of the system would then follow in the same way as in Sections 1.2- 
1.4, namely through the relationship 

S = 7clnr = fcln(fti/a>o), etc. (6) 

The basic problem then consists in determining w o. From dimensional considerations, 
see (2), coo must be in the nature of an “angular momentum raised to the power 31V.” To 
determine it exactly, we consider certain simplified systems, both from the point of view 
of the phase space and from the point of view of the distribution of quantum states. 


2.4 Examples 

We consider, first of all, the problem of a classical ideal gas composed of monatomic par¬ 
ticles; see Section 1.4. In the microcanonical ensemble, the volume co of the phase space 
accessible to the representative points of the (member) systems is given by 

/ / 

co = f ...J (d 3N qd 3N p), (1) 

where the integrations are restricted by the conditions that (i) the particles of the system 
are confined in physical space to volume V, and (ii) the total energy of the system lies 
between the limits (E - \ A) and (E + \ A). Since the Hamiltonian in this case is a function 
of the pi alone, integrations over the q, can be carried out straightforwardly; these give a 
factor of V N . The remaining integral is 

/ / /■••/ ^ 

/ \ 3iV / \/i\ / i \ 3iV / ^ 

yE— ypj/2 m )<[E+2^) 2ra^E—^ Aj< y?<2ra(£+2 Aj 

which is equal to the volume of a 31V-dimensional hypershell, bounded by hyperspheres 
of radii 






For A <$; E, this is given by the thickness of the shell, which is almost equal to A(m/2£) 1 / 2 , 
multiplied by the surface area of a 31V-dimensional hypersphere of radius ^f(2mE). By 
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equation (7) of Appendix C, we obtain for this integral 


A 



2 n 3N ' 2 
[(3JV/2) — 1]! 


(2 mE) (3N ~ 1)/2 


which gives 

A ^ N {2n mE) 3N / 2 
W ~E [(3JV/2J — 1]!' 


( 2 ) 


Comparing (2) with (1.4.17 and 1.4.17a), we obtain the desired correspondence, namely 


(a>/ r )asymp = 0)Q — . 


see also Problem 2.9. Quite generally, if the system under study has JV degrees of freedom, 
the desired conversion factor is 


co 0 = h’". 


(3) 


In the case of a single particle, eV = 3; accordingly, the number of microstates available 
would asymptotically be equal to the volume of the allowed region of the phase space 
divided by h 3 . Let Y,(P) denote the number of microstates available to a free particle con¬ 
fined to volume V of the physical space, its momentum p being less than or equal to a 
specified value P. Then 

(*<**)-jfT* <4) 

p<p 


from which we obtain for the number of micro states with momentum lying between p and 
p + dp 

, dY,(p ) V 9 , 

g(p)dp = — dp ~ j^lnp^dp. (5) 

Expressed in terms of the particle energy, these expressions assume the form 

P 47r q /o 

S(£)Ri FT (2m£) 7 (6) 


and 

a(s)ds = d^^-de ~ -^27r(2m) 3 ' /2 £ 1 ^ 2 de. (7) 

fle n - 5 

The next case we consider here is that of a one-dimensional simple harmonic oscillator. 
The classical expression for the Hamiltonian of this system is 

H( q , P )= l -W + - Lp 2 , 


( 8 ) 
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where k is the spring constant and m the mass of the oscillating particle. The space 
coordinate q and the momentum coordinate p of the system are given by 

q = Acos(a>t + <p), p = mq = — mcoAs\n(a>t + cj>), (9) 

A being the amplitude and co the (angular) frequency of vibration: 


co =,/( k/m ). 

The energy of the oscillator is a constant of the motion, and is given by 

E = -mco 2 A 2 . 

2 


( 10 ) 


(ID 


The phase-space trajectory of the representative point ( q,p ) of this system is determined 
by eliminating t between expressions (9) for q(t) and p(t); we obtain 


(2 E/mco 2 ) (2 mE) 


= 1, 


( 12 ) 


which is an ellipse, with axes proportional to JE and hence area proportional to E; to be 
precise, the area of this ellipse is 2ttE/co. Now, if we restrict the oscillator energy to the 
interval (E- \ A,E+ \ A), its representative point in the phase space will be confined to 
the region bounded by elliptical trajectories corresponding to the energy values (E + \ A) 
and (E - \ A). The “volume” (in this case, the area) of this region will be 


/ / 


2tt 


(i dqdp ) = 


( £ +Z A ) 2n ( E ~l A ) 2nA 


(13) 


(e- 1a ) <ff(g,p) < (e+ >a) 


According to quantum mechanics, the energy eigenvalues of the harmonic oscillator are 
given by 


En — 



n = 0,1,2,... 


(14) 


In terms of phase space, one could say that the representative point of the system must 
move along one of the “chosen” trajectories, as shown in Figure 2.2; the area of the phase 
space between two consecutive trajectories, for which A = fuo, is simply 27r/i. 5 For arbitrary 
values of E and A, such that E » A » hco, the number of eigenstates within the allowed 

5 Strictly speaking, the very concept of phase space is invalid in quantum mechanics because there, in principle, it is 
wrong to assign to a particle the coordinates q and p simultaneously. Nevertheless, the ideas discussed here are tenable 
in the correspondence limit. 
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P 



FIGURE 2.2 Eigenstates of a linear harmonic oscillator, in relation to its phase space. 


energy interval is very nearly equal to A /hco. Hence, the area of the phase space equivalent 
to one eigenstate is, asymptotically, given by 

coq = (2jt A/a>)/(A/ha>) = 2nh = h. (15) 

If, on the other hand, we consider a system of N harmonic oscillators along the same lines 
as above, we arrive at the result: coo = h N (see Problem 2.7). Thus, our findings in these 
cases are consistent with our earlier result (3). 

2.5 Quantum states and the phase space 

At this stage we would like to say a few words on the central role played here by the Planck 
constant h. The best way to appreciate this role is to recall the implications of the Heisen¬ 
berg uncertainty principle, according to which we cannot specify simultaneously both the 
position and the momentum of a particle exactly. An element of uncertainty is inherently 
present and can be expressed as follows: assuming that all conceivable uncertainties of 
measurement are eliminated, even then, by the very nature of things, the product of the 
uncertainties A q and A p in the simultaneous measurement of the canonically conjugate 
coordinates q and p would be of order h: 

(AqAp) m [ n ~ h. (1) 

Thus, it is impossible to define the position of a representative point in the phase space of 
the given system more accurately than is allowed by condition (1). In other words, around 
any point ( q , p) in the (two-dimensional) phase space, there exists an area of order h within 
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which the position of the representative point cannot be pinpointed. In a phase space of 
2 AT dimensions, the corresponding “volume of uncertainty” around any point would be 
of order . Therefore, it seems reasonable to regard the phase space as made up of ele¬ 
mentary cells, of volume ~ h' N , and to consider the various positions within such a cell 
as nondistinct. These cells could then be put into one-to-one correspondence with the 
quantum-mechanical states of the system. 

It is, however, obvious that considerations of uncertainty alone cannot give us the 
exact value of the conversion factor coo■ This could only be done by an actual counting 
of microstates on one hand and a computation of volume of the relevant region of the 
phase space on the other, as was done in the examples of the previous section. Clearly, a 
procedure along these lines could not be possible until after the work of Schrodinger and 
others. Elistorically, however, the first to establish the result (2.4.3) was Tetrode (1912) who, 
in his well-known work on the chemical constant and the entropy of a monatomic gas, 
assumed that 


m = ( zh) Jf , ( 2 ) 

where z was supposed to be an unknown numerical factor. Comparing theoretical results 
with the experimental data on mercury, Tetrode found that z was very nearly equal to unity; 
from this he concluded that “it seems rather plausible that z is exactly equal to unity, as has 
already been taken by O. Sackur (1911).” 6 

In the extreme relativistic limit, the same result was established by Bose (1924). In his 
famous treatment of the photon gas, Bose made use of Einstein’s relationship between the 
momentum of a photon and the frequency of the associated radiation, namely 


P = 


hv 
c ’ 


(3) 


and observed that, for a photon confined to a three-dimensional cavity of volume V, the 
relevant “volume” of the phase space, 

/ 

I {d 3 qd 3 p) — V4np 2 dp = V(4nh 3 v 2 /c 3 )dv, (4) 


would correspond exactly to the Rayleigh expression, 


V(4j rv 2 /c 3 )dv, 


(5) 


for the number of normal modes of a radiation oscillator, provided that we divide phase 
space into elementary cells of volume h 3 and put these cells into one-to-one corre¬ 
spondence with the vibrational modes of Rayleigh. It may, however, be added that a 
two-fold multiplicity of these states (g = 2) arises from the spin orientations of the photon 


6 For a more satisfactory proof of this result, see Section 5.5, especially equation (5.5.22). 
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(or from the states of polarization of the vibrational modes); this requires a multiplica¬ 
tion of both expressions (4) and (5) by a factor of 2, leaving the conversion factor h 3 
unchanged. 


Problems 


2.1. Show that the volume element 


3N 

dw — | | (dqj dpi) 
i=i 


of the phase space remains invariant under a canonical transformation of the (generalized) 
coordinates ( q,p ) to any other set of (generalized) coordinates (Q,P). 

[Hint-. Before considering the most general transformation of this kind, which is referred to as a 
contact transformation, it may be helpful to consider a point transformation — one in which the 
new coordinates Q, and the old coordinates cp transform only among themselves.] 

2.2. (a) Verify explicitly the invariance of the volume element do, of the phase space of a single particle 

under transformation from the Cartesian coordinates ( x,y , z, p x , p y , Pz ) to the spherical polar 
coordinates [r,9,cp,p r ,pg,p^). 

(b) The foregoing result seems to contradict the intuitive notion of “equal weights for equal solid 
angles,” because the factor sin0 is invisible in the expression for do). Show that if we average 
out any physical quantity, whose dependence on pg and p,p comes only through the kinetic 
energy of the particle, then as a result of integration over these variables we do indeed recover 
the factor sin0 to appear with the subelement ( d9 d(p). 

2.3. Starting with the line of zero energy and working in the (two-dimensional) phase space of a classical 
rotator, draw lines of constant energy that divide phase space into cells of “volume” h. Calculate the 
energies of these states and compare them with the energy eigenvalues of the corresponding 
quantum-mechanical rotator. 

2.4. By evaluating the “volume” of the relevant region of its phase space, show that the number of 
microstates available to a rigid rotator with angular momentum < M is (M/ft) 2 . Hence determine 
the number of microstates that may be associated with the quantized angular momentum 

Mj = + l)}ft, where j = 0,1,2,... or |, |,.... Interpret the result physically. 

[Hint-. It simplifies to consider motion in the variables 9 and </>, with M 2 =p 2 + (p</,/ sinf?) 2 .] 

2.5. Consider a particle of energy E moving in a one-dimensional potential well V ( q ), such that 


mh 


dV 

dq 


<^{m{E-V)} 3 ' 2 . 


Show that the allowed values of the momentum p of the particle are such that 



where n is an integer. 

2.6. The generalized coordinates of a simple pendulum are the angular displacement 9 and the angular 
momentum ml 2 9. Study, both mathematically and graphically, the nature of the corresponding 
trajectories in the phase space of the system, and show that the area A enclosed by a trajectory is 
equal to the product of the total energy E and the time period r of the pendulum. 

2.7. Derive (i) an asymptotic expression for the number of ways in which a given energy E can be 
distributed among a set of N one-dimensional harmonic oscillators, the energy eigenvalues of the 

oscillators being (n + \^ hw, n = 0,1,2,..., and (ii) the corresponding expression for the “volume” of 
the relevant region of the phase space of this system. Establish the correspondence between the 
two results, showing that the conversion factor wo is precisely h N . 
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2.8. Following the method of Appendix C, replacing equation (C.4) by the integral 

OO 

/ c " v *- 2 - 

0 


show that 


V 3N = j ...j PJ (^nrfdrij = (8nR 3 ) N /(3N)\. 

J N J i=1 

0<T.n<R 


Using this result, compute the “volume” of the relevant region of the phase space of an extreme 
relativistic gas (e = pc) of N particles moving in three dimensions. Hence, derive expressions for 
the various thermodynamic properties of this system and compare your results with those of 
Problem 1.7. 

2.9. (a) Solve the integral 



(dx i... dx 3jv) 


3N 

o<£ bil<R 
1=1 


and use it to determine the “volume” of the relevant region of the phase space of an extreme 
relativistic gas (e = pc) of 3N particles moving in one dimension. Determine, as well, the 
number of ways of distributing a given energy E among this system of particles and show that, 
asymptotically, a> 0 = h 3N . 

(b) Compare the thermodynamics of this system with that of the system considered in Problem 2.8. 




The Canonical Ensemble 


In the preceding chapter we established the basis of ensemble theory and made 
a somewhat detailed study of the microcanonical ensemble. In that ensemble the 
macrostate of the systems was defined through a fixed number of particles N, a fixed vol¬ 
ume V, and a fixed energy £ [or, preferably, a fixed energy range (£ — \ A,E+ \ A)]. The 
basic problem then consisted in determining the number Q(N, V,E), or F(N, V,E\ A), of 
distinct microstates accessible to the system. From the asymptotic expressions of these 
numbers, complete thermodynamics of the system could be derived in a straightforward 
manner. However, for most physical systems, the mathematical problem of determin¬ 
ing these numbers is quite formidable. For this reason alone, a search for an alternative 
approach within the framework of the ensemble theory seems necessary. 

Practically, too, the concept of a fixed energy (or even an energy range) for a system 
belonging to the real world does not appear satisfactory. For one thing, the total energy 
£ of a system is hardly ever measured; for another, it is hardly possible to keep its value 
under strict physical control. A far better alternative appears to be to speak of a fixed tem¬ 
perature T of the system — a parameter that is not only directly observable (by placing a 
“thermometer” in contact with the system) but also controllable (by keeping the system 
in contact with an appropriate “heat reservoir”). For most purposes, the precise nature of 
the reservoir is not very relevant; all one needs is that it should have an infinitely large 
heat capacity, so that, irrespective of energy exchange between the system and the reser¬ 
voir, an overall constant temperature can be maintained. Now, if the reservoir consists of 
an infinitely large number of mental copies of the given system we have once again an 
ensemble of systems — this time, however, it is an ensemble in which the macrostate of 
the systems is defined through the parameters N, V, and T. Such an ensemble is referred 
to as a canonical ensemble. 

In the canonical ensemble, the energy £ of a system is variable; in principle, it can 
take values anywhere between zero and infinity. The question then arises: what is the 
probability that, at any time t, a system in the ensemble is found to be in one of the states 
characterized by the energy value £ r ? 1 We denote this probability by the symbol P r . Clearly, 
there are two ways in which the dependence of P r on £, can be determined. One consists 
of regarding the system as in equilibrium with a heat reservoir at a common temperature T 
and studying the statistics of the energy exchange between the two. The other consists of 
regarding the system as a member of a canonical ensemble ( N , V, T), in which an energy 
8 is being shared by W identical systems constituting the ensemble, and studying the 

hn what follows, the energy levels E r appear as purely mechanical quantities — independent of the temperature of 
the system. For a treatment involving “temperature-dependent energy levels/' see Elcock and Landsberg (1957). 
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statistics of this sharing process. We expect that in the thermodynamic limit the final result 
in either case would be the same. Once P r is determined, the rest follows without difficulty. 


3.1 Equilibrium between a system and a heat 
reservoir 

We consider the given system A, immersed in a very large heat reservoir A'; see Figure 3.1. 
On attaining a state of mutual equilibrium, the system and the reservoir would have a 
common temperature, say T. Their energies, however, would be variable and, in principle, 
could have, at any time t, values lying anywhere between 0 and E (i}> , where E a)) denotes 
the energy of the composite system A (0) (= A+A'). If, at any particular instant of time, the 
system A happens to be in a state characterized by the energy value E r , then the reservoir 
would have an energy E' r , such that 

E r + E' r =E {0) = const. (1) 


Of course, since the reservoir is supposed to be much larger than the given system, any 
practical value of E r would be a very small fraction of E ,0) ; therefore, for all practical 
purposes, 


Er_ 
£( 0) 



( 2 ) 


With the state of the system A having been specified, the reservoir A' can still be in any 
one of a large number of states compatible with the energy value E' r . Let the number of 
these states be denoted by £2'(E' r ). The prime on the symbol O emphasizes the fact that 
its functional form will depend on the nature of the reservoir; of course, the details of 
this dependence are not going to be of any particular relevance to our final results. Now, 
the larger the number of states available to the reservoir, the larger the probability of the 
reservoir assuming that particular energy value EI r (and, hence, of the system A assum¬ 
ing the corresponding energy value E r ). Moreover, since the various possible states (with 
a given energy value) are equally likely to occur, the relevant probability would be directly 
proportional to this number; thus, 


P r OC Q\E' r ) = Q.\E m - E r ). 


(3) 



FIGURE 3.1 A given system A immersed in a heat reservoir A'; in equilibrium, the two have a common 
temperature T. 
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In view of (2), we may carry out an expansion of (3) around the value E' r — E a]) , that is, 
around E r = 0. However, for reasons of convergence, it is essential to effect the expansion 
of its logarithm instead: 

Inn \E' r ) = In <2’(E m ) + ( (. E’ r - E m ) + ■ ■ ■ 

V /B'=m 

~ const - p'E r , (4) 


where use has been made of formula (1.2.3), whereby 


9ln£2\ 

3-E ) N,V 


(5) 


note that, in equilibrium, /}' = /? = 1/kT. From (3) and (4), we obtain the desired result: 


P r oc exp(—/3E r ). 


( 6 ) 


Normalizing (6), we get 


exp (~pE r ) 

'' £exp (~fiE r y 

r 

where the summation in the denominator goes over all states accessible to the system A. 
We note that our final result (7) bears no relation whatsoever to the physical nature of the 
reservoir A'. 

We now examine the same problem from the ensemble point of view. 


3.2 A system in the canonical ensemble 

We consider an ensemble of W identical systems (which may be labelled as 1,2,..., W), 
sharing a total energy 8] let E r (r = 0,1,2,...) denote the energy eigenvalues of the systems. 
If n r denotes the number of systems which, at any time t, have the energy value E r , then 
the set of numbers {n r } must satisfy the obvious conditions 

Yn r = <V 

" Q) 

J2nrE r = 8 = JVU, 

r 

where U{— 8 /W) denotes the average energy per system in the ensemble. Any set {n r } 
that satisfies the restrictive conditions (1) represents a possible mode of distribution of the 
total energy 8 among the W members of the ensemble. Furthermore, any such mode can 
be realized in a number of ways, for we may effect a reshuffle among those members of 
the ensemble for which the energy values are different and thereby obtain a state of the 
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ensemble that is distinct from the original one. Denoting the number of different ways of 
doing so by the symbol W{n r ), we have 


W{n r } 


W! 

n 0 \ni\n 2 l...' 


( 2 ) 


In view of the fact that all possible states of the ensemble, which are compatible with con¬ 
ditions (1), are equally likely to occur, the frequency with which the distribution set {n r } 
may appear will be directly proportional to the number W{n r }. Accordingly, the “most 
probable” mode of distribution will be the one for which the number VC is a maximum. 
We denote the corresponding distribution set by {n*}; clearly, the set {n*} must also satisfy 
conditions (1). As will be seen in the sequel, the probability of appearance of other modes 
of distribution, however little they may differ from the most probable mode, is extremely 
low! Therefore, for all practical purposes, the most probable distribution set {n*} is the only 
one we have to contend with. 

However, unless this has been mathematically demonstrated, one must take into 
account all possible modes of distribution, as characterized by the various distribution 
sets {n r }, along with their respective weight factors W\n r \. Accordingly, the expectation 
values, or mean values, ( n r ) of the numbers n r would be given by 


{nr) = 


T! n r W{n r ] 

{Hr) 


T,'W{ n r ) ’ 
{n r I 


(3) 


where the primed summations go over all distribution sets that conform to conditions (1). 
In principle, the mean value (n r ), as a fraction of the total number W, should be a natural 
analog of the probability P r evaluated in the preceding section. In practice, however, the 
fraction n*/<N also turns out to be the same. 

We now proceed to derive expressions for the numbers n* and (n r ), and to show that, 
in the limit W -* oo, they are identical. 


The method of most probable values 

Our aim here is to determine that distribution set which, while satisfying conditions (1), 
maximizes the weight factor (2). For simplicity, we work with In W instead: 

InW = ln(e/V!) — yin(?z r !)- (4) 

r 

Since, in the end, we propose to resort to the limit W -»• oo, the values of n, (which are 
going to be of any practical significance) would also, in that limit, tend to infinity. It is, 
therefore, justified to apply the Stirling formula, ln(n!) n\nn — n, to (4) and write 


lnW= WlnW — Y_ w, In n r . 


(5) 
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If we shift from the set {n r } to a slightly different set { n r + Sn r }, then expression (5) would 
change by an amount 


<5(lnh0 = — ^(lnn r + l)8n r . (6) 

r 

Now, if the set {n r } is maximal, the variation <5(1 n W) should vanish. At the same time, 
in view of the restrictive conditions (1), the variations Sn r themselves must satisfy the 
conditions 


J2 Sn r = 0 

^ „ I (7) 

2_^E r &n r = 0 . 

r 

The desired set \n*\ is then determined by the method of Lagrange multipliers, 2 by which 
the condition determining this set becomes 

^{—(Inn* + 1) — a — ftE r }8n r = 0, (8) 

r 

where a and p are the Lagrangian undetermined multipliers that take care of the restrictive 
conditions (7) . In (8) , the variations S n r become completely arbitrary; accordingly, the only 
way to satisfy this condition is that all its coefficients must vanish identically, that is, for 
all r, 


Inn* = — (a + 1) — pE r , 


which gives 


n* = Cexp(— ftE r ), 


where C is again an undetermined parameter. 

To determine C and ft, we subject (9) to conditions (1), with the result that 

n£ _ exp (~pE r ) 

AT £exp {-pE r )’ 

r 


the parameter p being a solution of the equation 


Y,E r exp(-pE r ) 

— = U= -. 

t/V £exp( -pEr) 


(9) 


( 10 ) 


(11) 


2 For the method of Lagrange multipliers, see ter Haar and Wergeland (1966, Appendix C. 1). 
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Combining statistical considerations with thermodynamic ones, see Section 3.3, we can 
show that the parameter p here is exactly the same as the one appearing in Section 3.1, 
that is, p — 1/kT. 


The method of mean values 

Here we attempt to evaluate expression (3) for (n r ), taking into account the weight factors 
(2) and the restrictive conditions (1). To do this, we replace (2) by 


W[n r } 


... Uq Yl\ n,2 

J\F \COq CQ^ &>2 ... 
n 0 \ni\n 2 \... 


( 12 ) 


with the understanding that in the end all o> r will be set equal to unity, and introduce a 
function 


r ('V,U) = J^W{n r }, (13) 

{«r) 

where the primed summation, as before, goes over all distribution sets that conform to 
conditions (1). Expression (3) can then be written as 

(n r )=co r J- (lnr) . (14) 

da, r all <o r =\ 

Thus, all we need to know here is the dependence of the quantity In T on the parameters 
a) r . Now, 


E t I COr\ CO -I 

-V^7 

> hq\ n\\ 


{n r 


n 0 


»l 



(15) 


but the summation appearing here cannot be evaluated explicitly because it is restricted 
to those sets only that conform to the pair of conditions (1). If our distribution sets 
were restricted by the condition J2 r n r — ^ alone, then the evaluation of (15) would have 

been trivial; by the multinomial theorem, r(e/V) would have been simply (coq + coi H- . 

The added restriction J2 r n > ^ r = J\TU, however, permits the inclusion of only a “limited” 
number of terms in the sum — and that constitutes the real difficulty of the problem. 
Nevertheless, we can still hope to make some progress because, from a physical point 
of view, we do not require anything more than an asymptotic result — one that holds in 
the limit e/V -*■ oo. The method commonly used for this purpose is the one developed by 
Darwin and Fowler (1922a,b, 1923), which itself makes use of the saddle-point method of 
integration or the so-called method of steepest descent. 

We construct a generating function G(JSf ,z) for the quantity r(eA/", U ): 


OO 

G(JV,z) = J2 U)Z^ U 
U =0 


(16) 
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which, in view of equation (15) and the second of the restrictive conditions (1), may be 
written as 


G(W,z) = J2 
u= o 

It is easy to see that the summation over doubly restricted sets {n r }, followed by a summa¬ 
tion over all possible values of U, is equivalent to a summation over singly restricted sets 
{«,.}, namely the ones that satisfy only one condition: J2 r n r = Expression (17) can be 
evaluated with the help of the multinomial theorem, with the result 

G(e/V ,Z) = (ct)oZ E ° +a>iZ El -\ -^ 

= [/(z)]^, say. (18) 


) V ) 


(17) 


Now, if we suppose that the E r (and hence the total energy value 8 — tfsTU) are all integers, 
then, by (16), the quantity T(W, U) is simply the coefficient ofz'^ in the expansion of the 
function G(W ,z) as a power series in z. It can, therefore, be evaluated by the method of 
residues in the complex z-plane. 

To make this plan work, we assume to have chosen, right at the outset, a unit of energy 
so small that, to any desired degree of accuracy, we can regard the energies E r (and the pre¬ 
scribed total energy W U) as integral multiples of this unit. In terms of this unit, any energy 
value we come across will be an integer. We further assume, without loss of generality, that 
the sequence £o,£i,... is a nondecreasing sequence, with no common divisor; 3 also, for the 
sake of simplicity, we assume that Eq — 0. 4 The solution now is 


r(w,t/) = 

Znl 


L f(z■)]* 

Z WU+1 


dz, 


(19) 


where the integration is carried along any closed contour around the origin; of course, we 
must stay within the circle of convergence of the function/(z), so that a need for analytic 
continuation does not arise. 

First of all, we examine the behavior of the integrand as we proceed from the origin 
along the real positive axis, remembering that all our co r are virtually equal to unity and 
that 0 = Eq < Ei < Ez ■ ■ •. We find that the factor [/(z)] " v starts from the value 1 at z = 0, 
increases monotonically and tends to infinity as z approaches the circle of convergence of 
/(z), wherever that maybe. The factor z _( ' /VC7+1) , on the other hand, starts from a positive, 
infinite value at z = 0 and decreases monotonically as z increases. Moreover, the relative 
rate of increase of the factor [/(z)] 1 ^ itself increases monotonically while the relative rate 

3 Actually, this is not a serious restriction at all, for a common divisor, if any, can be removed by selecting the unit of 
energy correspondingly larger. 

4 This too is not serious, for by doing so we are merely shifting the zero of the energy scale; the mean energy U then 
becomes U - Eq, but we can agree to call it U again. 
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of decrease of the factor z -('* u + v > decreases monotonically. Under these circumstances, 
the integrand must exhibit a minimum (and no other extremum) at some value of z, say 
xq, within the circle of convergence. And, in view of the largeness of the numbers JSf and 
tA f U, this minimum may indeed be very steep! 

Thus, at z — Xq the first derivative of the integrand must vanish, while the second 
derivative must be positive and, hopefully, very large. Accordingly, if we proceed through 
the point z = x o in a direction orthogonal to the real axis, the integrand must exhibit an 
equally steep maximum. 5 Thus, in the complex z-plane, as we move along the real axis 
our integrand shows a minimum at z = xo, whereas if we move along a path parallel to 
the imaginary axis but passing through the point z — xo, the integrand shows a maximum 
there. It is natural to call the point xo a saddle point ; see Figure 3.2. For the contour of 
integration we choose a circle, with center at z = 0 and radius equal to xo, hoping that on 
integration along this contour only the immediate neighborhood of the sharp maximum 
at the point xo will make the most dominant contribution to the value of the integral. 6 

To carry out the integration we first locate the point x 0 . For this we write our integrand 
as 


i m\" 

Z JVU+1 


= exp[t/V g(z)\, 


( 20 ) 


where 


g(z) = In/ (z) 



( 21 ) 


|exp{A'g(z)}| 



FIGURE 3.2 The saddle point. 

5 This can be seen by noting that (i) an analytic function must possess a unique derivative everywhere (so, in our case, 
it must be zero, irrespective of the direction in which we pass through the point Xo), and (ii) by the Cauchy-Riemann 
conditions of analyticity, the second derivative of the function with respect to y must be equal and opposite to the second 
derivative with respect to x. 

6 It is indeed true that, for large «Af, the contribution from the rest of the circle is negligible. The intuitive reason for this 
is that the terms (m r z Er ), which constitute the function/(z), “reinforce” one another only at the point z = x o; elsewhere, 
there is bound to be disagreement among their phases, so that at all other points along the circle, \f(z)\ <f(x o). Now, the 
factor that actually governs the relative contributions is [|/(z)|//(JCo)] ,Ar ; for W 3> 1, this will clearly be negligible. For a 
rigorous demonstration of this point, see Schrodinger (1960, pp. 31-33). 
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while 


f{z) = J2 w rZ Er - 


The number x 0 is then determined by the equation 

/ . fix o) t/sru + 1 

s CJCto) = T, r-77— = °- 

/(xo) W x 0 

which, in view of the fact that JSf U » 1, can be written as 


U^x o 


/'(x 0 ) 


/(*o) E«rX^ r 


We further have 


g"(x 0 ) = 


/"(x 0 ) 

/(Xo) 


[/'(x 0 )] 2 \ Wt/+1 

[/(Xo)] 2 / Wx 2 


/"(xo) t / 2 - U 

fix o) X 2 


( 22 ) 


( 23 ) 


( 24 ) 


( 25 ) 


It will be noted here that, in the limit JST ->• oo and g (= W U) oo, m/th U staying constant, 

the number xo and the quantity g"(Xo) become independent of W. 

Expanding g(z ) about the point z = xo, along the direction of integration, that is, along 
the line z = xq + iy, we have 


g(z) = g(x 0 ) - -g (x 0 )y + • • ■ 
accordingly, the integrand (20) might be approximated as 


[fix o)]^ 

~mm ex P 


-fg"ix 0 )y 2 


( 26 ) 


Equation (19) then gives 


r , 1 [fix o)l^ 

r(W,t/)~— I exp 


2 Jti r? u+1 


/ 


JV n 

-Y^ {x ° )y 


idy 


[fix o)]^ 1 

x^u +1 ' {27rWg"(x 0 )) 1 /2’ 


( 27 ) 


which gives 


Ztt lnT(W, [/) = (ln/(x 0 ) - C/lnxo) - lnx 0 


1 

2W 


ln(2jrWg"(x 0 )). 


( 28 ) 
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In the limit W -»• oo (with U staying constant), the last two terms in this expression tend 
to zero, with the result 

-^-lnF(eyV, U) = ln/(xo) - U\yixq. (29) 

Jv 

Substituting for/(xo) and introducing a new variable fi, defined by the relationship 


x 0 = exp(—,6), 


(30) 


we get 


In T (W, U) = In | (Or exp (~pE r ) | 


+ PU. 


(31) 


The expectation value of the number n r then follows from (14) and (31): 


{Hr) 

w r exp( -pE r ) 

Y,o)rE r exp(-pEr) } 

r | uK 

W 

Y,co r exp(—f)E r ) 

- r 

;>> r exp (~pE r ) ' J ' dair 


(32) 


The term inside the curly brackets vanishes identically because of (24) and (30) . It has been 
included here to emphasize the fact that, for a fixed value of U, the number /)(= — lnx 0 ) in 
fact depends on the choice of the cop, see (24). We will appreciate the importance of this 
fact when we evaluate the mean square fluctuation in the number rip in the calculation of 
the expectation value of n r , this does not really matter. We thus obtain 


jn r ) _ exp(-/3 E r ) 
W £ exp (-££,-)’ 


(33) 


which is identical to expression (10) for n*/W. The physical significance of the parameter 
P is also the same as in that expression, for it is determined by equation (24) , with all co r = 1, 
that is, by equation (11) which fits naturally with equation (33) because U is nothing but 
the ensemble average of the variable E r : 

U=Y J E r P < = -^Y J E r^- (34) 


Finally, we compute fluctuations in the values of the numbers n r . We have, first of all, 


(n 2 r ) = 


Y,'r%W{n r } 

(«r) 


Z'mnr} 

{n r } 



(35) 


see equations (12) to (14). It follows that 


((A nr) 2 ) = ({ n r - (n r )} 2 ) = (n 2 ) - ( n r ) 2 




all tor—l 


(36) 
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Substituting from (31) and making use of (32), we get 


<(A« r ) 2 ) 3 

-= CD, - 

W 3 (D r 


(Drexp(-pEr) 

exp(-pE r ) 


+ 


Y.M r E r exp(-fSE r ) ] ' 

T , r rl 3p 

’ — t— i -H - U / CO i- 

£> r exp(- J S£' r ) I dcDr 


- all (Or= 1 


(37) 


We note that the term in the curly brackets would not make any contribution because it 
is identically zero, whatever the choice of the co r . However, in the differentiation of the first 
term, we must not forget to take into account the implicit dependence of ft on the u> r , 
which arises from the fact that unless the co r are set equal to unity the relation determining 
ft does contain o> r ; see equations (24) and (30), whereby 


y'.wrEr exp(—ftE r ) 

r 

Y,u r exp(-pE r ) 


all ojf — 1 


A straightforward calculation gives 


/_30_\ E r -U (n r ) 

\ 9a> r / u a||ft)r=| {$)-U2 


(38) 


(39) 


We can now evaluate (37), with the result 

((A«r) 2 ) (n r ) f(n r )\ 2 ,(n r ) /TT r J dp \ 

-^A^= aT-IatJ + ^ {U - Er} {d^ r ) l 


all COr= 1 


(n r ) 

w 


1- 


(n r ) (n r ) (E r — Uf 


W W {(Er-U) 2 ) 

For the relative fluctuation in n r , we get 

(Er-U ) 2 


f A M 2 \ 

1 

1 ( 

\(nr)J 1 

- (n r ) 



((Er-U) 2 ) 


(40) 


(41) 


As jsf -> oo, ( n r ) also — > oo, with the result that the relative fluctuations in n, tend to zero; 
accordingly, the canonical distribution becomes infinitely sharp and with it the mean 
value, the most probable value — in fact, any values of n r that appear with nonvanish¬ 
ing probability — become essentially the same. And that is the reason why two wildly 
different methods of obtaining the canonical distribution followed in this section have led 
to identical results. 
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3.3 Physical significance of the various statistical 
quantities in the canonical ensemble 

We start with the canonical distribution 

p = ( n r) _ exp (~pE r ) 

W J2 ex P(~PEr) ’ 


where p is determined by the equation 


J2E r exp{-pE r ) 

r 

X]exp(-/3 E r ) 


-^ ln {E ex p ( -^)}- 


(1) 


( 2 ) 


We now look for a general recipe to extract information about the various macroscopic 
properties of the given system on the basis of the foregoing statistical results. For this, 
we recall certain thermodynamic relationships involving the Helmholtz free energy 
A(= U - TS ), namely 


and 


dA = dU — TdS — SdT = -SdT - PdV + /x dN, 

s=-(A) , P= W , 

\dT ) NV \dV J N,T \91V / V j 


U = A+TS = A- 


( dA \ - T 2 

r 9 



~d(A/T)~ 

\dT / jv,V 

_dT 

Kt) J 

N,V 

U(1/T)J 


(3) 

(4) 


(5) 


where the various symbols have their usual meanings. Comparing (5) with (2), we infer that 
there exists a close correspondence between the quantities entering through the statistical 
treatment and the ones coming from thermodynamics, namely 


ln|^exp(-^£ r )| = - 


A 

kT’ 


( 6 ) 


where lc is a universal constant yet to be determined; soon we shall see that k is indeed the 
Boltzmann constant. 

The equations in (6) constitute the most fundamental result of the canonical ensemble 
theory. Customarily, we write it in the form 


A(N, V, T) = -fcTln Q n (V, T), 


(7) 


where 


Qn(V, T) = J2 ex P(~Er/kT). 


(8) 
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The quantity Qn(V, T) is referred to as the partition function of the system; sometimes it 
is also called the “sum-over-states” (German: Zustandssumme). The dependence of Q on 
T is quite obvious. The dependence on N and V comes through the energy eigenvalues 
E r ; in fact, any other parameters that might govern the values E r should also appear in the 
argument of Q. Moreover, for the quantity A(N, V, T) to be an extensive property of the 
system, In Q must also be an extensive quantity. 

Once the Helmholtz free energy is known, the rest of the thermodynamic quantities 
follow straightforwardly. While the entropy, the pressure and the chemical potential are 
obtained from formulae (4), the specific heat at constant volume follows from 


Cy = 



dT 2 j 

/ N,V 


and the Gibbs free energy from 


G=A+PV=A-V 


\ =N (u 


dVJ 


N,T 


\dN 


= Nn; 


V,T 


(9) 


( 10 ) 


see Problem 3.5. 

At this stage it appears worthwhile to make a few remarks on the foregoing results. First 
of all, we note from equations (4) and (6) that the pressure P is given by 


E d ~w exp(-/S Et) 

r 

£exp(-/3 E r ) 


( 11 ) 


so that 


PdV = -J^PrdEr = -dU. (12) 

r 

The quantity on the right side of this equation is clearly the change in the average energy 
of a system (in the ensemble) during a process that alters the energy levels E r , leaving the 
probabilities P r unchanged. The left side then tells us that the volume change dV provides 
an example of such a process, and the pressure P is the “force” accompanying that process. 
The quantity P, which was introduced here through the thermodynamic relationship (3), 
thus acquires a mechanical meaning as well. 

The entropy of the system is determined as follows. Since P r — Q~ l exp(—/)£», 


(In P r ) = -InQ-p(Er) = P(A- U) = -S/k, 


with the result that 


S=-k{lnP r ) = -kJ^PrlnPr- 


(13) 
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This is an extremely interesting relationship, for it shows that the entropy of a physical sys¬ 
tem is solely and completely determined by the probability values P r (of the system being 
in different dynamical states accessible to it)! 

From the very look of it, equation (13) appears to be of fundamental importance; 
indeed, it reveals a number of interesting conclusions. One of these relates to a system 
in its ground state (T = OK). If the ground state is unique, then the system is sure to be 
found in this particular state and in no other; consequently, P r is equal to 1 for this state 
and 0 for all others. Equation (13) then tells us that the entropy of the system is precisely 
zero, which is essentially the content of the Nernst heat theorem or the third law of ther¬ 
modynamics. 1 We also infer that vanishing entropy and perfect statistical order (which 
implies complete predictability about the system) go together. As the number of acces¬ 
sible states increases, more and more of the P r become nonzero; the entropy of the system 
thereby increases. As the number of states becomes exceedingly large, most of the P- 
values become exceedingly small (and their logarithms assume large, negative values); the 
net result is that the entropy becomes exceedingly large. Thus, the largeness of entropy 
and the high degree of statistical disorder (or unpredictability) in the system also go 
hand in hand. 

It is because of this fundamental connection between entropy on one hand and lack of 
information on the other that equation (13) became the starting point of the pioneering 
work of Shannon (1948,1949) in the development of the theory of communication. 

It may be pointed out that formula (13) applies in the microcanonical ensemble as well. 
There, for each member system of the ensemble, we have a group of states, all equally 
likely to occur. The value of P r is, then, 1/ ^ for each of these states and 0 for all others. 
Consequently, 



(14) 


which is precisely the central result in the microcanonical ensemble theory; see equa¬ 
tion (1.2.6) or (2.3.6). 

3.4 Alternative expressions for the partition function 

In most physical cases the energy levels accessible to a system are degenerate, that is, one 
has a group of states, g ; in number, all belonging to the same energy value E t . In such cases 
it is more useful to write the partition function (3.3.8) as 


Qn(V, T ) = exp(-/?£,); 


( 1 ) 


7 Of course, if the ground state of the system is degenerate (with a multiplicity Qo), then the ground-state entropy is 
nonzero and is given by the expression k In Qo; see equation (14). 
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the corresponding expression for Pj, the probability that the system be in a state with 
energy Ei, would be 


p = gjexpi-pEj) 

1 Y.gi cx P<-P E i)' 

i 

Clearly, the g, states with a common energy Ej are all equally likely to occur. As a result, 
the probability of a system having energy Ej becomes proportional to the multiplicity g, of 
this level; g/ thus plays the role of a “weight factor” for the level E,. The actual probability is 
then determined by the weight factor g/ as well as by the Boltzmann factor exp(— pE{) of the 
level, as we have in (2) . The basic relations established in the preceding section, however, 
remain unaffected. 

Now, in view of the largeness of the number of particles constituting a given system and 
the largeness of the volume to which these particles are confined, the consecutive energy 
values Ej of the system are, in general, very close to one another. Accordingly, there lie, 
within any reasonable interval of energy (E,E+ dE), a very large number of energy levels. 
One may then regard £ as a continuous variable and write P(E)dE for the probability that 
the given system, as a member of the canonical ensemble, may have its energy in the range 
(E, E + dE). Clearly, this probability will be given by the product of the relevant single-state 
probability and the number of energy states lying in the specified range. Denoting the 
latter by g{E)dE, where g(E) denotes the density of states around the energy value E, we 
have 


P(E)dE cx exp(—f)E)g(E)dE 


(3) 


which, on normalization, becomes 

, exp (—BE)g(E)dE 

P(E)dE = ^ -- (■ 

/ exp {~pE)g(E)dE 
o 

The denominator here is yet another expression for the partition function of the system: 


OO 

Qn(V, T) — J e~P E g{E)dE. (5) 

o 

The expression for (/>, the expectation value of a physical quantity /, may now be 
written as 


</) = J2fi P i = 


Hm)gie P E i 


Lae 




oo 

Jf(E)e-t iE g(E)dE 

0 

oo 

/ e~P E g(E)dE 
o 


( 6 ) 
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Before proceeding further, we take a closer look at equation (5) and note that, with 
P > 0, the partition function Q(p) is just the Laplace transform of the density of states g(E). 
We may, therefore, write g(E) as the inverse Laplace transform of Q(fi): 

fi'+ioo 

g(E) — ./ . [ eP E Q(P)dp (Ji' > 0) 

Z7 XI J 

P'—io o 
oo 

= — f e^' +ip " )E Q(P' + ip")dp", 

2jc J 

—OO 

where p is now treated as a complex variable, p' + ip", while the path of integration 
runs parallel to, and to the right of, the imaginary axis, that is, along the straight line 
Re p = P' > 0. Of course, the path may be continuously deformed so long as the integral 
converges. 


(7) 

( 8 ) 


3.5 The classical systems 

The theory developed in the preceding sections is of very general applicability. It applies to 
systems in which quantum-mechanical effects are important as well as to those that can 
be treated classically. In the latter case, our formalism maybe written in the language of the 
phase space; as a result, the summations over quantum states get replaced by integrations 
over phase space. 

We recall the concepts developed in Sections 2.1 and 2.2, especially formula (2.1.3) for 
the ensemble average (/) of a physical quantity f(q,p), namely 

_ jf(q,p)p(q,p)d 3N qd 3N p 

[J> f p(q,p)d 3N qd 3N p ’ UJ 

where p(q, p) denotes the density of the representative points (of the systems) in the phase 
space; we have omitted here the explicit dependence of the function p on time t because 
we are interested in the study of equilibrium situations only. Evidently, the function p(q, p) 
is a measure of the probability of finding a representative point in the vicinity of the phase 
point ( q,p ), which in turn depends on the corresponding value H(q,p ) of the Hamiltonian 
of the system. In the canonical ensemble, 

p(q,p)cxexp[-pH(q,p)}-, (2) 

compare to equation (3.1.6). The expression for (/> then takes the form 

= ff(q,p)exp(-pH)da> 
f exp(—f}H)da> 


(3) 
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where dw(= d 3N qd 3N p) denotes a volume element of the phase space. The denomina¬ 
tor of this expression is directly related to the partition function of the system. However, 
to write the precise expression for the latter, we must take into account the relationship 
between a volume element in the phase space and the corresponding number of distinct 
quantum states of the system. This relationship was established in Sections 2.4 and 2.5, 
whereby an element of volume dco in the phase space corresponds to 


N\ h 3N 

distinct quantum states of the system. 8 The appropriate expression for the partition 
function would, therefore, be 


Qn(V,T) = -f e ~ mq ’ P)da)] (5) 

it is understood that the integration in (5) goes over the whole of the phase space. 

As our first application of this formulation, we consider the example of an ideal gas. 
Here, we have a system of N identical molecules, assumed to be monatomic (so there are 
no internal degrees of motion to be considered), confined to a space of volume V and in 
equilibrium at temperature T. Since there are no intermolecular interactions to be taken 
into account, the energy of the system is wholly kinetic: 

N 

H(q,p) = '^T{p 2 i /2m). (6) 

i=i 

The partition function of the system would then be 

i r N 

Qn{V ’ T) = mm f e ~ W2m) ^ Y\(d 3 qid 3 Pi ). ( 7 ) 

i= 1 

Integrations over the space coordinates are rather trivial; they yield a factor of V N . Integra¬ 
tions over the momentum coordinates are also quite easy, once we note that integral (7) is 
simply a product of N identical integrals. Thus, we get 


Qn(Y, T) = 


V N 


N\h 3N 


LXJ 

/' 


,—p 2 /2mkT 


(^4np 2 dp^j 


N 


1 

m 


mkT) 3/2 

h 6 


-i N 


( 8 ) 

(9) 


8 Ample justification has already been given for the factor h 3N . The factor N\ comes from the considerations of 
Sections 1.5 and 1.6; it arises essentially from the fact that the particles constituting the given system are not only 
identical but, in fact, indistinguishable. For a complete proof of this result, see Section 5.5. 
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here, use has been made of equation (B.13a). The Helmholtz free energy is then given by, 
using Stirling’s formula (B.29), 


A(N, V, T) = -kT\n Q N (V, T) = NkT 


In 


NI h 2 
V \ 2nmkT 



( 10 ) 


The foregoing result is identical to equation (1.5.8), which was obtained by following a very 
different procedure. The simplicity of the present approach is, however, striking. Needless 
to say, the complete thermodynamics of the ideal gas can be derived from equation (10) in 
a straightforward manner. For instance, 


-(a.H H^T\ 

/dA\ _ NkT 
\91Viv,r v 


and 


S 



= Nk 


In 


N,V 


V /2nmkT\ 3/2 } 5 

N V h 2 ) } + 2 


( 11 ) 

( 12 ) 


(13) 


These results are identical to the ones derived previously, namely (1.5.7), (1.4.2), and 
(1.5.1a), respectively. In fact, the identification of formula (12) with the ideal-gas law, 
PV — nRT, establishes the identity of the (hitherto undetermined) constant k as the 
Boltzmann constant ; see equation (3.3.6). We further obtain 


u= - 

4( ln Q) 

= -T 2 



_ d/3 

E r 

_dT \ T J _ 


= A+TS= -NkT, 


J N,V 


(14) 


and so on. 

At this stage we have an important remark to make. Looking at the form of equation (8) 
and the manner in which it came about, we may write 

Qn(V,T)=^[Q 1 (V,T)] n , (15) 

where Q] (V, T) may be regarded as the partition function of a single molecule in the sys¬ 
tem. A little reflection will show that this result obtains essentially from the fact that the 
basic constituents of our system are noninteracting (and hence the total energy of the 
system is simply the sum of their individual energies). Clearly, the situation will not be 
altered even if the molecules in the system had internal degrees of motion as well. What 
is essentially required for equation (15) to be valid is the absence of interactions among 
the basic constituents of the system (and, of course, the absence of quantum-mechanical 
correlations). 
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Going back to the ideal gas, we could as well have started with the density of states g(E). 
From equation (1.4.17), and in view of the Gibbs correction factor, we have 


S(E) = ™ « JL ( 2 *m) 3N/2 e (3N/2)—1 

6 dE N\ \h 3 J {(3A7/2) — 1}! 


(16) 


Substituting this into equation (3.4.5), and noting that the integral involved is equal to 
{(3AT/2) - 1}!//J 3]V / 2 , we obtain 


1 / V\ N (27tm\ 3N/2 

Qw( ^m(p) hr) ' 


(17) 


which is identical to (9). It may also be noted that if one starts with the single-particle 
density of states (2.4.7), namely 


aWK^em) 3 ^ 1 ' 2 , (18) 

computes the single-particle partition function, 

OO q 

f V /2nmY /2 

Qm = J e~^a(s)ds = ¥ l —) , (19) 

o 

and then makes use of formula (15), one would arrive at the same result for Qn(V, T). 

Lastly, we consider the question of determining the density of states, g(E), from 
the expression for the partition function, Q(/3) — assuming that the latter is already 
known; indeed, expression (9) for Q(/3) was derived without making use of any knowledge 
regarding the function g(E). According to equation (3.4.7) and (9), we have 


V N 

8(E) = N\ 


2n m\ 3N ' 2 
h 2 ) 


fi'+ioc 


- I 

Ini J 


JE 


2 n i J p 3N / 2 

fi'—io o 


(/S' > 0). 


Noting that, for all positive n, 


1 

2ni 


s'+zoo 

/ 



s'—zoo 



for x > 0 
for x < 0, 


( 20 ) 


( 21) 9 


equation (20) becomes 


g(E) = 


V N 

(2 Ttm 

N\ ' 

( h 2 


0 


3N/2 £(3iV/2)-l 

{(3N/2) - 1}! 


for E > 0 
for E < 0, 


( 22 ) 


9 For the details of this evaluation, see Kubo (1965, pp. 165-168). 
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which is indeed the correct result for the density of states of an ideal gas; compare to 
equation (16). The foregoing derivation may not appear particularly valuable because in 
the present case we already knew the expression for g{E). However, cases do arise where 
the evaluation of the partition function of a given system and the consequent evaluation of 
its density of states turn out to be quite simple, whereas a direct evaluation of the density 
of states from first principles is rather involved. In such cases, the method given here can 
indeed be useful; see, for example, Problem 3.15 in comparison with Problems 1.7 and 2.8. 


3.6 Energy fluctuations in the canonical ensemble: 
correspondence with the microcanonical 
ensemble 


In the canonical ensemble, a system can have energy anywhere between zero and infinity. 
On the other hand, the energy of a system in the microcanonical ensemble is restricted 
to a very narrow range. How, then, can we assert that the thermodynamic properties of a 
system derived through the formalism of the canonical ensemble would be the same as 
the ones derived through the formalism of the microcanonical ensemble? Of course, we 
do expect that the two formalisms yield identical results, for otherwise our whole scheme 
would be marred by internal inconsistency. And, indeed, in the case of an ideal classical 
gas the results obtained by following one approach were precisely the same as the ones 
obtained by following the other approach. What, then, is the underlying reason for this 
equivalence? 

The answer to this question is obtained by examining the extent of the range over which 
the energies of the systems in the canonical ensemble have a significant probability to 
spread; that will tell us the extent to which the canonical ensemble really differs from the 
microcanonical one. To explore this point, we write down the expression for the mean 
energy 


Y,E r exp(-pE r ) 



( 1 ) 


r 


and differentiate it with respect to the parameter ft, holding the energy values E, constant. 
We obtain 


2 


dP £exp (~pE r ) + 


£f?r exp(-fiEr) 
r 


J2E r exp(-^E r ) 


2 


r 


£exp( -pE r ) 


r 


(E 2 ) + (E) 2 , 


( 2 ) 


from which it follows that 



(3) 
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Note that we have here the specific heat at constant volume, because the partial differen¬ 
tiation in (2) was carried out with the E r kept constant! For the relative root-mean-square 
fluctuation in E, equation (3) gives 

VK(a E) 2 )} = V(fcr 2 Cy) 

(E) U 

which is 0(1V -1 / 2 ), N being the number of particles in the system. Consequently, for large 
N (which is true for every statistical system) the relative r.m.s. fluctuation in the values of E 
is quite negligible! Thus, for all practical purposes, a system in the canonical ensemble has 
an energy equal to, or almost equal to, the mean energy U ; the situation in this ensemble 
is, therefore, practically the same as in the microcanonical ensemble. That explains why 
the two ensembles lead to practically identical results. 

For further understanding of the situation, we consider the manner in which energy is 
distributed among the various members of the (canonical) ensemble. To do this, we treat 
E as a continuous variable and start with expression (3.4.3), namely 


P(E)dE oc exp(—jS E)g{E)dE. 


(3.4.3) 


The probability density P(E) is given by the product of two factors: (i) the Boltzmann factor, 
which monotonically decreases with E, and (ii) the density of states, which monotonically 
increases with E. The product, therefore, has an extremum at some value of E, say E*. w 
The value E* is determined by the condition 


TE ^ E m) 


= 0 , 


E=E* 


that is, by 


9 In g(E) 
dE 


= P- 

E=E* 


(5) 


Recalling that 


S = king and 


dS(E) \ 

dE )E=U 


1 

T 


= kp, 


the foregoing condition implies that 


E* = U. (6) 

This is a very interesting result, for it shows that, irrespective of the physical nature of 
the given system, the most probable value of its energy is identical to its mean value. 
Accordingly, if it is advantageous, we may use one instead of the other. 

'“Subsequently we shall see that this extremum is actually a maximum — and an extremely sharp one at that. 
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We now expand the logarithm of the probability density P(E) around the value E* U; 

we get 


In 




(E — Ur + ■ 


E=U 


= ~P(U— TS) — 


2kT 2 C\ 


-C E-UY + - 


(7) 


from which we obtain 


P(E) oce ftE g(E) ~ e ^ u TS) exp | - |- ( 8 > 

This is a Gaussian distribution in E, with mean value U and dispersion ^/(kT 2 Cv'y, compare 
with equation (3) . In terms of the reduced variable E/U, the distribution is again Gaussian, 
with mean value unity and dispersion ^/{kT 2 Cy)/U {which is 0(/V~ 1/2 )|; thus, for N » 1, 
we have an extremely sharp distribution which, as N -»• oo, approaches a delta-function! 

It would be instructive here to consider once again the case of a classical ideal gas. 
Here, g(E) is proportional to e & n / 2 ~ 1) and hence increases very fast with E; the factor 
e~P E , of course, decreases with E. The product g(E)exp(—/3E) exhibits a maximum at 
E* — (3AT/2 — l)/} -1 , which is practically the same as the mean value U — (3AT/2)/! -1 . For 
values of E significantly different from E*, the product essentially vanishes (for smaller val¬ 
ues of E, due to the relative paucity of the available energy states; for larger values of E, due 
to the relative depletion caused by the Boltzmann factor). The overall picture is shown in 
Figure 3.3 where we have displayed the actual behavior of these functions in the special 
case N — 10. The most probable value of E is now |4 of the mean value; so, the distribution 
is somewhat asymmetrical. The effective width A can be readily calculated from (3) and 
turns out to be (2/3 N) l ! 2 U, which, for N — 10, is about a quarter of U. We can see that, 
as N becomes large, both E* and U increase (essentially linearly with N), the ratio E*/U 
approaches unity and the distribution tends to become symmetrical about E*. At the same 
time, the width A increases (but only as iV 1 / 2 ); considered in the relative sense, it tends to 
vanish (as IV -1 / 2 ). 

We finally look at the partition function Qn(V, T), as given by equation (3.4.5), with its 
integrand replaced by (8). We have 


ou 

Q n (V, T)~e-^ U ~ TS) J e- (E - U)2 t 2kr2Cv dE 


OO 

~ e -UU-TS) j {2kT 2 Cv) J e -x 2 dx 

—OO 


= e-^ u - TS) ^(2nkT 2 C v ), 
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FIGURE 3.3 The actual behavior of the functions g(E), e P E , and g(E)e P E for an ideal gas, with N = 10. The 
numerical values of the functions have been expressed as fractions of their respective values at E = U. 


so that 


-kTlnQ N (V,T)=A~(U-TS)- hcT\n(2nkT 2 C v ). 


(9) 


The last term, being 0(ln iV), is negligible in comparison with the other terms, which are 
all O(N). Hence, 


A^U-TS. (10) 

Note that the quantity A in this formula has come through the formalism of the 
canonical ensemble, while the quantity S has come through a definition belonging to 
the microcanonical ensemble. The fact that we finally end up with a consistent thermo¬ 
dynamic relationship establishes beyond doubt that these two approaches are, for all 
practical purposes, identical. 

3.7 Two theorems — the "equipartition" 
and the "virial" 

To derive these theorems, we determine the expectation value of the quantity 
Xi(dH/dXj), where H{q,p) is the Hamiltonian of the system (assumed classical) while xt 
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and xj are any two of the 6N generalized coordinates (q, p). In the canonical ensemble, 


m\ f{ x t^) e pHda 


dXi 


f e~P H da> 


(dco — d 3N q d 3N p^J . 


( 1 ) 


Let us consider the integral in the numerator. Integrating over xj by parts, it becomes 


/ 



Xje 


-PH 


(Xj) 2 
(Xj) 1 




e P H dxj 


da>, 


0> 


here, (xj) \ and {xj) 2 are the “extreme” values of the coordinate Xj, while do) (J) denotes “dm 
devoid of dxj.” The integrated part here vanishes because whenever any of the coordinates 
takes an “extreme” value the Hamiltonian of the system becomes infinite. 11 In the integral 
that remains, the factor dXi/dXj, being equal to Sjj, comes out of the integral sign and we 
are left with 


j^ij J e p "da>. 

Substituting this into (1), we arrive at the remarkable result: 


dH 
3 Xj 


= SijkT, 


which is independent of the precise form of the function H. 
In the special case x,- = xj = pi, equation (2) takes the form 



(Pidi) = kT, 


( 2 ) 


(3) 


while for x,- = xj = qt, it becomes 



~{qiPi) = kT. 


(4) 


Adding over all i, from i = 1 Lo 3N, we obtain 



(5) 


n For instance, if xj is a space coordinate, then its extreme values will correspond to “locations at the walls of the con¬ 
tainer"; accordingly, the potential energy of the system would become infinite. If, on the other hand, Xj is a momentum 
coordinate, then its extreme values will themselves be ±oo, in which case the kinetic energy of the system would become 
infinite. 
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and 


Wij = 3NkT ■ (6) 

Now, in many physical situations the Hamiltonian of the system happens to be a quadratic 
function of its coordinates; so, through a canonical transformation, it can be brought into 
the form 


H =j2 A j p f+j: B jQj 

i j 


( 7 ) 


where Pj and Qj are the transformed, canonically conjugate, coordinates while Aj and Bj 
are certain constants of the problem. For such a system, we clearly have 



( 8 ) 


accordingly, by equations (3) and (4), 


(H) = \fkT, ( 9 ) 

where / is the number of nonvanishing coefficients in expression (7) . We, therefore, con¬ 
clude that each harmonic term in the (transformed) Hamiltonian makes a contribution 
of \IcT toward the internal energy of the system and, hence, a contribution of jk toward 
the specific heat Cy. This result embodies the classical theorem of equipartition of energy 
(among the various degrees of freedom of the system). It may be mentioned here that, 
for the distribution of kinetic energy alone, the equipartition theorem was first stated by 
Boltzmann (1871). 

In our subsequent study we shall find that the equipartition theorem as stated here is 
not always valid; it applies only when the relevant degrees of freedom can be freely excited. 
At a given temperature T, there may be certain degrees of freedom which, due to the insuf¬ 
ficiency of the energy available, are more or less “frozen” due to quantum mechanical 
effects. Such degrees of freedom do not make a significant contribution toward the inter¬ 
nal energy of the system or toward its specific heat; see, for example, Sections 6.5, 7.4, 
and 8.3. Of course, the higher the temperature of the system the better the validity of this 
theorem. 

We now consider the implications of formula (6). First of all, we note that this formula 
embodies the so-called virial theorem of Clausius (1870) for the quantity (J^itfiPi)’ which is 
the expectation value of the sum of the products of the coordinates of the various particles 
in the system and the respective forces acting on them; this quantity is generally referred to 
as the virial of the system and is denoted by the symbol V. The virial theorem then states 
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that 


V = —3NkT. (10) 

The relationship between the virial and other physical quantities of the system is best 
understood by first looking at a classical gas of noninteracting particles. In this case, the 
only forces that come into play are the ones arising from the walls of the container; these 
forces can be designated by an external pressure P that acts on the system by virtue of 
the fact that it is bounded by the walls of the container. Consequently, we have here a force 
—P dS associated with an element of area dS of the walls; the negative sign appears because 
the force is directed inward while the vector dS is directed outward. The virial of the gas is 
then given by 


T 0 = = -P<j>r-dS, (ll) 12 

\ i / o J s 

where r is the position vector of a particle that happens to be in the (close) vicinity of 
the surface element dS ; accordingly, r may be considered to be the position vector of the 
surface element itself. By the divergence theorem, equation (11) becomes 

V 0 = ~pJ (div r)dV = -3 PV. (12) 

v 

Comparing (12) with (10), we obtain the well-known result: 


PV = NkT. (13) 

The internal energy of the gas, which in this case is wholly kinetic, follows from the 
equipartition theorem (9) and is equal to | NkT, 3N being the number of degrees of 
freedom. Comparing this result with (10), we obtain the classical relationship 


V = -2 K, 


(14) 


where K denotes the average kinetic energy of the system. 

It is straightforward to apply this theorem to a system of particles interacting through 
a two-body potential u(r). In the thermodynamic limit, the pressure of a d-dimensional 
system depends only on the virial terms arising from the forces between pairs of particles: 



1 

NdkT 




= 1 - 


1 I du(rjj) \ 

NdkT \ti 7 


(15) 


12 It will be noted that the summation over the various particles of the system, which appears in the definition of the 
virial, has been replaced by an integration over the surface of the container, for the simple reason that no contribution 
to the virial arises from the interior of the container. 
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Equation (15) is called the virial equation of state. This equation can also be written in 
terms of the pair correlation function, equation (10.7.11), and is also used in computer 
simulations to determine the pressure of the system; see Problem 3.14, Section 10.7, and 
Section 16.4. 


3.8 A system of harmonic oscillators 

We shall now examine a system of N, practically independent, harmonic oscillators. This 
study will not only provide an interesting illustration of the canonical ensemble formu¬ 
lation but will also serve as a basis for some of our subsequent studies in this text. Two 
important problems in this line are (i) the theory of the black-body radiation (or the “sta¬ 
tistical mechanics of photons”) and (ii) the theory of lattice vibrations (or the “statistical 
mechanics of phonons”); see Sections 7.3 and 7.4 for details. 

We start with the specialized situation when the oscillators can be treated classically. 
The Hamiltonian of any one of them (assumed to be one-dimensional) is then given by 

H(qt,Pi)= ^m,(o 2 q 2 i + ^p? (i= l,...,N). (1) 

For the single-oscillator partition function, we readily obtain 

oo oo 

QM) = / / ° p (-f Q-»»v+ f /)) ^ 

—oo —oo 

_ 1 / 2?r \ 1/2 /2jrm\ 1/2 _ 1 _ kT 

h \fimco 2 ) \ fi ) f)ha> ha>’ 

where h— Ii/2jt. This represents a classical counting of the average number of accessible 
microstates — that is, kT divided by the quantum harmonic oscillator energy spacing. The 
partition function of the W-oscillator system would then be 

QN(P) = lQim N = (Phur N ={j-J ; ( 3 ) 


note that in writing (3) we have assumed the oscillators to be distinguishable. This is so 
because, as we shall see later, these oscillators are merely a representation of the energy 
levels available in the system; they are not particles (or even “quasiparticles”). It is actu¬ 
ally photons in one case and phonons in the other, which distribute themselves over the 
various oscillator levels, that are indistinguishable ! 

The Helmholtz free energy of the system is now given by 


A = -kT\nQ N = NkTln 



( 4 ) 
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whereby 


ll = kT,Dt 'vf)’ 


P= 0 , 

S = Nk 
U = NkT, 


<)+> 


(5) 

( 6 ) 

(7) 

( 8 ) 


and 


Cp = Cy = Nk. 


(9) 


We note that the mean energy per oscillator is in complete agreement with the equiparti- 
tion theorem, namely 2 x \kT, for we have here two independent quadratic terms in the 
single-oscillator Hamiltonian. 

We may determine the density of states, g(E), of this system from expression (3) for its 
partition function. We have, in view of (3.4.7), 


that is, 


g(E) = 


P'+ioo 

1 1 C P? E 


(ho) N 2n 


C 

l J W‘“ > 


fi'—ioo 


g(E) = 


1 


pJV-i 


for E > 0 


( hm) N (N-iy. 

0 for B< 0 . 


( 10 ) 


To test the correctness of (10), we may calculate the entropy of the system with the help of 
this formula. Taking N » 1 and making use of the Stirling approximation, we get 


S(N,E ) = k lng(F) ^ Nk 


In 


E ) + > 


Nhai 


(ID 


which gives for the temperature of the system 


dS\ 


-l 


T= — — ~r~rr • 


dEj 


N 


Nk' 


( 12 ) 


Eliminating E between these two relations, we obtain precisely our earlier result (7) for the 
function S(N, T). 
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We now take up the quantum-mechanical situation, according to which the energy 
eigenvalues of a one-dimensional harmonic oscillator are given by 


£n — 


[n+^jhw, n = 0,1,2,... 
Accordingly, we have for the single-oscillator partition function 

exp I 

Q x (P) = ^2 e"^ ( " +1/2)fi ® 

72=0 

= |2sinh hco 

The Af-oscillator partition function is then given by 



Qn(P) = [Qi(^)] w = 


2sinh( -phco 


-N 


_ i _ e -piw>\-N 


For the Helmholtz free energy of the system, we get 

'1 


A = NkT In 


2 sinh ( - p hco 


= N 


^hco + kT\n{l-e~P ha> } 


whereby 


fi — A/N, 
P= 0, 

S = Nk 


1 


1 


pha >coth — fiha) 1 — In 12sinh -fihai 


1 


= Nk 


/3ha> 

gfihco _ J 


- lnjl - e 


1 


—f5hco\ 


and 


U = -Nheucoth -phco ) = N 


Cp = Cy = Nk\-fihui) cosech \-fiha) 


1 hco 

— hco -\—- 

2 _ i 


1 


= Nk(fihco) 2 


pfihco 


(gfihco _ ]\2 * 


(13) 


(14) 


(15) 


(16) 


(17) 

(18) 


(19) 

( 20 ) 


( 21 ) 


Formula (20) is especially significant, for it shows that the quantum-mechanical oscil¬ 
lators do not obey the equipartition theorem. The mean energy per oscillator is different 
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FIGURE 3.4 The mean energy (e) of a simple harmonic oscillator as a function of temperature. 1, the Planck 
oscillator; 2, the Schrodinger oscillator; and 3, the classical oscillator. 


from the equipartition value kT ; actually, it is always greater than kT; see curve 2 in 
Figure 3.4. Only in the limit of high temperatures, where the thermal energy kT is much 
larger than the energy quantum hco, does the mean energy per oscillator tend to the 
equipartition value. It should be noted here that if the zero-point energy ^hco were not 
present, the limiting value of the mean energy would be (kT - \hm), and not kT — we 
may call such an oscillator the Planck oscillator, see curve 1 in Figure 3.4. In passing, we 
observe that the specific heat (21), which is the same for the Planck oscillator as for the 
Schrodinger oscillator, is temperature-dependent; moreover, it is always less than, and at 
high temperatures tends to, the classical value (9). 

Indeed, for kT^>hu>, formulae (14) through (21) go over to their classical counterparts, 
namely (2) through (9), respectively. 

We shall now determine the density of states g(E) of the N-oscillator system from its 
partition function (15). Carrying out the binomial expansion of this expression, we have 


R =0 


Q N (P) = Y, I + R ~ \ e-^ Nhm+Rha>) 


R 


( 22 ) 


Comparing this with the formula 


CXJ 

Qn(P) = J g(E)e~P E dE, 


we conclude that 


R =0 


R 


g(E) = Y2( N + ~ 


(23) 
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where <5(x) denotes the Dirac delta function. Equation (23) implies that there are 
(N + R — 1)! /R\ (N - 1)! microstates available to the system when its energy E has the dis¬ 
crete value ( R+ \N)hco , where R— 0,1,2,..., and that no microstate is available for other 
values of E. This is hardly surprising, but it is instructive to look at this result from a slightly 
different point of view. 

We consider the following problem that arises naturally in the microcanonical ensem¬ 
ble theory. Given an energy E for distribution among a set of N harmonic oscillators, each 
of which can be in any one of the eigenstates (13), what is the total number of distinct ways 
in which the process of distribution can be carried out? Now, in view of the form of the 
eigenvalues e n , it makes sense to give away, right in the beginning, the zero-point energy 
\hci> to each of the N oscillators and convert the rest of it into quanta (of energy hco). Let R 
be the number of these quanta; then 


R = 



-Nhco 

2 



ha>. 


(24) 


Clearly, R must be an integer; by implication, E must be of the form (R + \N)hco . The prob¬ 
lem then reduces to determining the number of distinct ways of allotting R quanta to N 
oscillators, such that an oscillator may have 0 or 1 or 2... quanta; in other words, we have 
to determine the number of distinct ways of putting R indistinguishable balls into N dis¬ 
tinguishable boxes, such that a box may receive 0 or 1 or 2... balls. A little reflection will 
show that this is precisely the number of permutations that can be realized by shuffling R 
balls, placed along a row, with (N — 1) partitioning lines (that divide the given space into N 
boxes); see Figure 3.5. The answer clearly is 


(R + N — 1)! 
R\(N — 1)! ’ 


(25) 


which agrees with (23). 

We can now determine the entropy of the system from the number (25). Since N » 1, 
we have 


S sb lc{ln(R + N)\ - Inf?! — In AT!) 

sb lc{(R +N)ln(R +N) — R\nR — AflnAT); (26) 


FIGURE 3.5 Distributing 17 indistinguishable balls among 7 distinguishable boxes. The arrangement shown here 
represents one of the 231/1716! distinct ways of carrying out the distribution. 
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the number R is, of course, a measure of the energy E of the system; see (24). For the 
temperature of the system, we obtain 


1_/9S\ -f dS ) 1 - k ln( R + N \ 
T \dEj N \dRj N ha> hm \ R ) 


k 1 /E+^Nhcv 
hco \£- \Nhw 


(27) 


so that 


E 1 exp(ha>/kT) +1 
N 2 M exp(hco/kT) — 1 ’ 


(28) 


which is identical to (20) . It can be further checked that, by eliminating R between (26) and 
(27), we obtain precisely the formula (19) for S(N, T ). Thus, once again, we find that the 
results obtained by following the microcanonical approach and the canonical approach 
are the same in the thermodynamic limit. 

Finally, we may consider the classical limit when E/N, the mean energy per oscillator, 
is much larger than the energy quantum hco, that is, when R^> N. The expression (25) may, 
in that case, be replaced by 


(R + N- 1)(R + N-2)...(R+1) _ R 1 *- 1 
(IV-1)! ^ (IV-1)!’ 


(25a) 


with 


R sb E/hcu. 

The corresponding expression for the entropy turns out to be 

E 


S «b k{N\n(R/N) + N] sa Nlc In 


Nhw 


+ 1 , 


which gives 


so that 


1 /9S\ _ Nk 

T~{dEj N ’ 


— « kT. 
N 


(26a) 


(27a) 


(28a) 


These results are identical to the ones derived in the classical limit earlier in this section. 


3.9 The statistics of paramagnetism 

Next, we study a system of N magnetic dipoles, each having a magnetic moment fi. In the 
presence of an external magnetic field H, the dipoles will experience a torque tending to 
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align them in the direction of the field. If there were nothing else to check this tendency, 
the dipoles would align themselves precisely in this direction and we would achieve 
a complete magnetization of the system. In reality, however, thermal agitation in the 
system offers resistance to this tendency and, in equilibrium, we obtain only a partial 
magnetization. Clearly, as T -*■ OK, the thermal agitation becomes ineffective and the 
system exhibits a complete orientation of the dipole moments, whatever the strength 
of the applied field; at the other extreme, as T -»• oo, we approach a state of complete 
randomization of the dipole moments, which implies a vanishing magnetization. At 
intermediate temperatures, the situation is governed by the parameter (pH/kT). 

The model adopted for this study consists of N identical, localized (and, hence, dis¬ 
tinguishable), practically static, mutually noninteracting and freely orientable dipoles. We 
consider first the case of classical dipoles that can be oriented in any direction relative to 
the applied magnetic field. It is obvious that the only energy we need to consider here is 
the potential energy of the dipoles that arises from the presence of the external field H and 
is determined by the orientations of the dipoles with respect to the direction of the field: 

N N N 

E=Y, E i = ~Yl IL i H =- llH 12 COS0/. (1) 

i= 1 i= 1 i= 1 

The partition function of the system is then given by 

Qn(P) = [Qi(/6)] w , (2) 


where 


Qi(P) = exp(f) pH cos 9). 

g 


(3) 


The mean magnetic moment M of the system will obviously be in the direction of the field 
H\ for its magnitude we shall have 


P cos 9 exp(PpH cos 9) 


M z = N{pcos6) — N 


J^exp(ppH cos 9) 

9 


N d 
JdH 


In Q\ (P) = 



(4) 


Thus, to determine the degree of magnetization in the system all we have to do is to 
evaluate the single-dipole partition function (3). 

First, we proceed classically (after Langevin, 1905a,b). Using (sin OdOdcp) as the elemen¬ 
tal solid angle representing a small range of orientations of the dipole, we get 


2n n 

Qi(P) = f [ sineded( p - si nh(/3/xU) , 

J J PpH 

o o 


( 5 ) 




72 Chapter 3 • The Canonical Ensemble 


so that 


V-z — = M |coth(/3/xH) - J = pL(PpH), (6) 

where L(x) is the so-called Langevin function 

L{ x) — cothx— (7) 

x 

a plot of the Langevin function is shown in Figure 3.6. We note that the parameter fpH 
denotes the strength of the (magnetic) potential energy pH compared to the (thermal) 
kinetic energy kT. 

If we have No dipoles per unit volume in the system, then the magnetization of the 
system, namely the mean magnetic moment per unit volume, is given by 


M z0 = N 0 p z = N 0 pL(x) (x = PuH). (8) 

For magnetic fields so strong (or temperatures so low) that the parameter x^> 1, the 
function L(x) is almost equal to 1; the system then acquires a state of magnetic saturation: 

P z -P and M z0 ~N 0 p. (9) 

For temperatures so high (or magnetic fields so weak) that the parameter r«l, the 
function L(x) may be written as 


x 

3 



which, in the lowest approximation, gives 


Mzo — 


N 0 p 2 

3kT 


H. 


( 10 ) 


( 11 ) 



FIGURE 3.6 The Langevin function L(x). 
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The high-temperature isothermal susceptibility of the system is, therefore, given by 


X T = Lim 

T H^O 


dMzo\ 
d H ) T 


Nqu 2 
3 lcT 


C 

j’ say. 


( 12 ) 


Equation (12) is the Curie law of paramagnetism, the parameter C being the Curie constant 
of the system. Figure 3.7 shows a plot of the susceptibility of a powdered sample of copper- 
potassium sulphate hexahydrate as a function of T -1 ; the fact that the plot is linear and 
passes almost through the origin vindicates the Curie law for this particular salt. 

We shall now treat the problem of paramagnetism quantum-mechanically. The major 
modification here arises from the fact that the magnetic dipole moment // and its compo¬ 
nent ijl z in the direction of the applied field cannot have arbitrary values. Quite generally, 
we have a direct relationship between the magnetic moment /r of a given dipole and its 
angular momentum l: 





(13) 


with 

Z 2 =/(/+l)ft 2 ; /= or 0,1,2,... (14) 

The quantity g(e/2mc) is the gyromagnetic ratio of the dipole while the number g is Lande’s 
g-factor. If the net angular momentum of the dipole is due solely to electron spins, then 



FIGURE 3.7 X versus 1/T plot for a powdered sample of copper-potassium sulphate hexahydrate (after Hupse, 
1942). 
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g — 2; on the other hand, if it is due solely to orbital motions, then g= 1. In general, 
however, its origin is mixed; g is then given by the formula 


_ 3 S(S+ 1) — L(L + 1) 

8 ~ 2 + 27 ( 7 + 1 ) 


(15) 


S and L being, respectively, the spin and the orbital quantum numbers of the dipole. Note 
that there is no upper or lower bound on the values that g can have! 

Combining (13) and (14), we can write 


M 2 — g 2 +§/(/ + 1)> 


(16) 


where pb(— eh/2mc) is the Bohr magneton. The component p z of the magnetic moment 
in the direction of the applied field is, on the other hand, given by 


^z = gf-B m ’ rn = -],-J+l,...,J-l,J. (17) 

Thus, a dipole whose magnetic moment p conforms to expression (16) can have no other 
orientations with respect to the applied field except the ones conforming to the values (17) 
of the component p z ; obviously, the number of allowed orientations, for a given value of 
/, is (2/+ 1). In view of this, the single-dipole partition function Qi(/8) is now given by, 
see (3), 


i 

Qt(yS) = ex P(18) 

m=—J 

Introducing a parameter x, defined by 


x = Pig 


(19) 


equation (18) becomes 


mx/ , e -X{ e (2J+l) x /J _ 1} 

Q l( «= E emx/J = e x „_ l — 1 

m=—J 

g(2J-\-l)x/2J _ g-(2J-\-\)x/2J 

— e x/2J _ q-x/2J 


= sinh {| 1 + — 


7H/ 


sinh | —x 


The mean magnetic moment of the system is then given by, see equation (4), 


( 20 ) 


M z = N/i z = j E In Qi (fi) 


= N(gn H J) 


1 + — | cothI (1 + — | xl —coth | — x 
2 J V 2 j) 2/ 2/ 


( 21 ) 
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Thus 


U z = (g/V )Bj(x), (22) 

where Bj(x) is the Brillouin function of order /: 

B ;(X)=(l+ 1) coth j (l + *} - Yj c0th i h X \ ■ (23) 

In Figure 3.8 we have plotted the function Bj(x ) for some typical values of the quantum 
number/. 

We shall now consider a few special cases. First of all, we note that for strong fields and 
low temperatures (jc » 1), the function B/(x) ~ 1 for all /, which corresponds to a state of 
magnetic saturation. On the other hand, for high temperatures and weak fields (x <$; 1), the 
function Bj(x) may be written as 


so that 


i(l + l/7)x+..., 


- _ (gMjj/) 2 

~ 3 kT 



3 kT 


H. 


(24) 


(25) 


The Curie law, x oc 1/T, is again obeyed; however, the Curie constant is now given by 

„ Nog^lhJ+D Non 2 

Cj = - 3 k - = (26) 

see equation (16). It is indeed interesting that the high-temperature results, (25) and (26), 
directly involve the eigenvalues of the operator n 2 . 



FIGURE 3.8 The Brillouin function Bj(x) for various values of /. 
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We now look a little more closely at the dependence of the foregoing results on the 
quantum number/. First of all, we consider the extreme case / -*■ oo, with the understand¬ 
ing that simultaneously g -»■ 0, such that the value of /i stays constant. From equation (23), 
we readily observe that, in this limit, the Brillouin function Bj{x) tends to become (i) inde¬ 
pendent of/ and (ii) identical to the Langevin function L(x). This is not surprising because, 
in this limit, the number of allowed orientations for a magnetic dipole becomes infinitely 
large, with the result that the problem essentially reduces to its classical counterpart 
(where one must allow all possible orientations). At the other extreme, we have the case 
J =\, which allows only two orientations. The results in this case are very different from 
the ones for / » 1. We now have, with g — 2, 


— I^ b Bi/ 2 (x) = /r B tanhx. (27) 

For x » 1, p z is very nearly equal to p B . For x <$; 1, however, ji z — p B x, which corresponds 
to the Curie constant 


Cl/2 = 


N 0 /i 2 b 

k 


( 28 ) 


In Figure 3.9 we reproduce the experimental values of ~jl z (in terms of /i/j) as a function 
of the quantity H/T, for three paramagnetic salts; the corresponding theoretical plots, 
namely the curves gJBj[x), are also included in the figure. The agreement between theory 
and experiment is indeed good. In passing, we note that, at a temperature of 1.3 K, a field 
of about 50,000 gauss is sufficient to produce over 99 percent of saturation in these salts. 



FIGURE 3.9 Plots of JC z //j,b as a function of H/T. The solid curves represent the theoretical results, while the points 
mark the experimental findings of Henry (1952). Curve I is for potassium chromium alum (J = |,g = 2), curve II for 
iron ammonia alum (/ = |,g = 2), and curve III for gadolinium sulphate octahydrate (/ = j,g= 2). 
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3.10 Thermodynamics of magnetic systems: 
negative temperatures 

For the purpose of this section, it will suffice to consider a system of dipoles with J =\- 
Each dipole then has a choice of two orientations, the corresponding energies being -hbH 
and T/rg/f; let us call these energies —e and +e, respectively. The partition function of the 
system is then given by 

Qn 06) = + e~P E J = {2 coshC/Se)} 7 ^; (1) 

compare to the general expression (3.9.20). Accordingly, the Helmholtz free energy of the 
system is given by 


A= —NkTln{2cosh(s/kT)}, 


( 2 ) 


from which 


and, finally, 




[H 

U=A+TS=-Ne tanh(^), 
) r = iVw? tanh(^) 


(£)!■ 


2cosh(-H — ^tanh(^)], 


M = -[^| = 


3 H, 


c =(jT)„= Nt (m) 2 ’ xct? (m)- 


(3) 

(4) 

(5) 

( 6 ) 


Equation (5) is essentially the same as (3.9.27); moreover, as expected, U — —MH. 

The temperature dependence of the quantities S, U, M, and C is shown in Figures 3.10 
through 3.13. We note that the entropy of the system is vanishingly small for kT <<c e; it rises 
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FIGURE 3.10 The entropy of a system of magnetic dipoles (with / = |) as a function of temperature. 
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FIGURE 3.11 The energy of a system of magnetic dipoles (with J =\) as a function of temperature. 



FIGURE 3.12 The magnetization of a system of magnetic dipoles (with / = |) as a function of temperature. 


rapidly when kT is of the order of e and approaches the limiting value Mdn2 for kT e. 
This limiting value of S corresponds to the fact that at high temperatures the orientation 
of the dipoles assumes a completely random character, with the result that the system 
now has 2 N equally likely microstates available to it. The energy of the system attains 
its lowest value, -Ne, as OK; this clearly corresponds to a state of magnetic satura¬ 
tion and, hence, to a state of perfect order in the system. Toward high temperatures, the 
energy tends to vanish, 13 implying a purely random orientation of the dipoles and hence 

13 Note that in the present study we are completely disregarding the kinetic energy of the dipoles. 
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FIGURE 3.13 The specific heat of a system of magnetic dipoles (with / = j) as a function of temperature. 

a complete loss of magnetic order. These features are re-emphasized in Figure 3.12, which 
depicts the temperature dependence of the magnetization M. The specific heat of the sys¬ 
tem is vanishingly small at low temperatures but, in view of the fact that the energy of the 
system tends to a constant value as T -* oo, the specific heat vanishes at high tempera¬ 
tures as well. Somewhere around T — s/k, it displays a maximum. Writing A for the energy 
difference between the two allowed states of the dipole, the formula for the specific heat 
can be written as 



(7) 


A specific heat peak of this form is generally known as the Schottky anomaly; it is observed 
in systems that have an excitation gap A above the ground state. 

Now, throughout our study so far we have considered only those cases for which T > 0. 
For normal systems, this is indeed essential, for otherwise we have to contend with canon¬ 
ical distributions that blow up as the energy of the system is indefinitely increased. If, 
however, the energy of a system is bounded from above, then there is no compelling reason 
to exclude the possibility of negative temperatures. Such specialized situations do indeed 
exist, and the system of magnetic dipoles provides a good example thereof. From equa¬ 
tion (4), we note that, so long as U < 0, T > 0 — and that is the only range we covered in 
Figures 3.10 through 3.13. However, the same equation tells us that if U > 0 then T < 0, 
which prompts us to examine the matter a little more closely. For this, we consider the 
variation of the temperature T and the entropy S with energy U, namely 



( 8 ) 
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FIGURE 3.14 The entropy of a system of magnetic dipoles (with / = |) as a function of energy. Some values of the 
parameter kT/s are also shown in the figure. The slope at the two endpoints diverges since both ends represent 
zero temperature but it is difficult to see due to the logarithmic nature of the divergence. 


and 


S Ne+U, (Ne + U\ Ne-U, (Ns — U\ 

—- =-In --In - 1; (9) 

Nk 2 Ne V 2Ns ) V 2 Ns ) 

these expressions follow straightforwardly from equations (3) and (4), and are shown 
graphically in Figures 3.14 and 3.15. We note that for U = —Ne, both S and T vanish. 
As U increases, they too increase until we reach the special situation where U — 0. The 
entropy is then seen to have attained its maximum value M;ln2, while the temperature has 
reached infinity. Throughout this range, the entropy had been a monotonically increasing 
function of energy, so T was positive. Now, as U becomes 0+, ( dS/dU ) becomes 0_ and 
T becomes — oo. With a further increase in U, the entropy monotonically decreases; as a 
result, the temperature continues to be negative, though its magnitude steadily decreases. 
Finally, we reach the largest value of U, namely +Ne, where the entropy is once again zero 
and T = 0_. 

The region where U > 0 (and hence T < 0) is indeed abnormal because it corresponds 
to a magnetization opposite in direction to that of the applied field. Nevertheless, it can be 
realized experimentally in the system of nuclear moments of a crystal in which the relax¬ 
ation time t\ for mutual interaction among nuclear spins is very small in comparison with 
the relaxation time t 2 for interaction between the spins and the lattice. Let such a crystal 
be magnetized in a strong magnetic field and then the field reversed so quickly that the 
spins are unable to follow the switch-over. This will leave the system in a nonequilibrium 
state, with energy higher than the new equilibrium value U. During a period of order t \, 
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FIGURE 3.15 The temperature parameter kT/e, and its reciprocal fie, for a system of magnetic dipoles (with / = \) 
as a function of energy. 


the subsystem of the nuclear spins should be able to attain a state of internal equilibrium; 
this state will have a negative magnetization and will, therefore, correspond to a negative 
temperature. The subsystem of the lattice, which involves energy parameters that are in 
principle unbounded, will still be at a positive temperature. During a period of order t 2 , 
the two subsystems would attain a state of mutual equilibrium, which again will have a 
positive temperature. 14 An experiment of this kind was successfully performed by Purcell 
and Pound (1951) with a crystal of LiF; in this case, t\ was of order 10 -5 sec while I 2 was of 
order 5 min. A state of negative temperature for the subsystem of spins was indeed attained 
and was found to persist for a period of several minutes; see Figure 3.16. 

Before we close this discussion, a few general remarks seem in order. First of all, we 
should note that the onset of negative temperatures is possible only if there exists an upper 
limit on the energy of the given system. In most physical systems this is not the case, 
simply because most physical systems possess kinetic energy of motion which is obvi¬ 
ously unbounded. By the same token, the onset of positive temperatures is related to the 

14 Note that in the latter process, during which the spins realign themselves (now more favorably in the new direction 
of the field), the energy will flow from the subsystem of the spins to that of the lattice, and not vice versa. This is in perfect 
agreement with the fact that negative temperatures are hotter than positive ones; see the subsequent discussion in the 
text. 
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FIGURE 3.16 Atypical record of the reversed nuclear magnetization (after Purcell and Pound, 1951). On the left we 
have a deflection corresponding to normal, equilibrium magnetization (T ~ 300K); it is followed by the reversed 
deflection (corresponding to T ~ —350K), which decays through zero deflection (corresponding to a passage from 
T = -oo to T = +oo) toward the new equilibrium state that again has a positive T. 


existence of a lower limit on the energy of a system; this, however, does not present any 
problem because, if nothing else, the uncertainty principle alone is sufficient to set such 
a limit for every physical system. Thus, it is quite normal for a system to be at a positive 
temperature whereas it is very unusual for one to be at a negative temperature. 

Now, suppose that we have a system whose energy cannot assume unlimited high 
values. Then, we can surely visualize a temperature T such that the quantity NkT is 
much larger than any admissible value, E r , of the energy. At such a high temperature, the 
mutual interactions of the microscopic entities constituting the system may be regarded 
as negligible; accordingly, one may write for the partition function of the system 


Qn(i6) 



Since, by assumption, all /5s n « 1, we have 


Qn(i 3) 



( 10 ) 


( 11 ) 


Let g denote the number of possible orientations of a microscopic constituent of the sys¬ 
tem with respect to the direction of the external field; then, the quantities J2 n £ n( a = 0,1,2) 
may be replaced by ge“. We thus get 


InQjv(^) — N 


lng + ln^l - f}e+ ^/f 2 g 2 
lng —/Sg + ^ 2 (e 2 -e 2 ) 


-AT 


( 12 ) 
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The Helmholtz free energy of the system is then given by 


A(N,0)~ - — Ing + iVe- -f(s-e) 2 , 
P 2 


(13) 


from which 


NJc 

S(N, fi) ~ Nk lng — — f 2 (e-e) 2 , 

(14) 

U(N,f) ~ Ns — Nf(e — e) 2 , 

(15) 


and 


C(N,P) ~ Nkfi 2 (e — e) 2 . (16) 15 

The formulae in equations (12) through (16) determine the thermodynamic properties of 
the system for fi ~ 0. The important thing to note here is that they do so not only for fi > 0 
but also for fi < 0. In fact, these formulae hold in the vicinity of, and on both sides of, the 
maximum in the S—U curve; see Figure 3.14. Quite expectedly, the maximum value of S is 
given by Aiding, and it occurs at fi = ±0; S here decreases both ways, whether U decreases 
(fi > 0) or increases (fi < 0). It will be noted that the specific heat of the system in either 
case is positive. 

It is not difficult to show that if two systems, characterized by the temperature parame¬ 
ters /Si and /S 2 , are brought into thermal contact, then energy will flow from the system with 
the smaller value of fi to the system with the larger value of fi] this will continue until the 
two systems acquire a common value of this parameter. What is more important to note is 
that this result remains literally true even if one or both of the fi are negative. Thus, if Pi is 
—ve while fi 2 is +ve, then energy will flow from system 1 to system 2, that is, from the sys¬ 
tem at negative temperature to the one at positive temperature. In this sense, systems at 
negative temperatures are hotter than the ones at positive temperatures; indeed, negative 
temperatures are above +oo, not below zero! 

For further discussion of this topic, reference may be made to a paper by Ramsey (1956). 


Problems 

3.1. (a) Derive formula (3.2.36) from equations (3.2.14) and (3.2.35). 

(b) Derive formulae (3.2.39) and (3.2.40) from equations (3.2.37) and (3.2.38). 

3.2. Prove that the quantity g"(x 0 ), see equations (3.2.25), is equal to {(E — U ) 2 ) exp(2/i). Thus show that 
equation (3.2.28) is physically equivalent to equation (3.6.9). 

3.3. Using the fact that (1/n!) is the coefficient ofx n in the power expansion of the function exp(x), 
derive an asymptotic formula for this coefficient by the method of saddle-point integration. 
Compare your result with the Stirling formula for n\. 

15 Compare this result with equation (3.6.3). 






84 Chapter 3 • The Canonical Ensemble 


3.4. Verify that the quantity (fc/«/V) In r, where 

rCAf, U) = J2'w{nA, 

(«r) 

is equal to the (mean) entropy of the given system. Show that this leads to essentially the same 
result for In r if we take, in the foregoing summation, only the largest term of the sum, namely the 
term W{n*} that corresponds to the most probable distribution set. 

[Surprised? Well, note the following example: 


For all IV, the summation over the binomial coefficients N C r =N\/[r\(N — r!)] gives 


N 


J2 N Cr = 2 n ; 

r=0 


therefore, 


N C r | =JVln2. (a) 

Now, the largest term in this sum corresponds to r ~ N/ 2; so, for large N, the logarithm of the 
largest term is very nearly equal to 



ln{Ai!) — 21n{(lV/2)!) 

N N 

^N\nN-2-ln—=Nln2, (b) 

2 2 


which agrees with (a).] 

3.5. Making use of the fact that the Helmholtz free energy A(N, V, T) of a thermodynamic system is an 
extensive property of the system, show that 



V,T 



3A \ 

w) n ,t 


=A. 


[Note that this result implies the well-known relationship: Np = A + PV(= G).] 

3.6. (a) Assuming that the total number of microstates accessible to a given statistical system is Q, 

show that the entropy of the system, as given by equation (3.3.13), is maximum when ail Q 
states are equally likely to occur. 

(b) If, on the other hand, we have an ensemble of systems sharing energy (with mean value E), 
then show that the entropy, as given by the same formal expression, is maximum when 

P r a exp (—pE r ), f> being a constant to be determined by the given value of E. 

(c) Further, if we have an ensemble of systems sharing energy (with mean value E) and also 
sharing particles (with mean value IV), then show that the entropy, given by a similar 
expression, is maximum when P r , s oc exp(—aN r — pE s ), a and fi being constants to be 
determined by the given values of N and E. 

3.7. Prove that, quite generally 


Cp — Cv = —k 


[ar| 

T I 

( dlnQ 
V 

)j 

l 


/ 3 2 In Q \ 

T 


> 0 . 


Verify that the value of this quantity for a classical ideal classical gas is Nk. 
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3.8. Show that, for a classical ideal gas, 



+ T 


31nQA 
3T ) P 


3.9. If an ideal monatomic gas is expanded adiabatically to twice its initial volume, what will the ratio 
of the final pressure to the initial pressure be? If during the process some heat is added to the 
system, will the final pressure be higher or lower than in the preceding case? Support your answer 
by deriving the relevant formula for the ratio Pf/Pj. 

3.10. (a) The volume of a sample of helium gas is increased by withdrawing the piston of the containing 

cylinder. The final pressure Pi is found to be equal to the initial pressure Pj times (V,/Vj-) L2 , Vj 
and Vf being the initial and final volumes. Assuming that the product PV is always equal to 
| U, will (i) the energy and (ii) the entropy of the gas increase, remain constant, or decrease 
during the process? 

(b) If the process were reversible, how much work would be done and how much heat would be 
added in doubling the volume of the gas? Take Pj = 1 atm and Vj = 1 m 3 . 

3.11. Determine the work done on a gas and the amount of heat absorbed by it during a compression 
from volume Vi to volume V 2 , following the lawPV" = const. 

3.12. If the “free volume” V of a classical system is defined by the equation 



N 

e {U-U(q,))/kTy[ d 3 qu 
i= 1 


where U is the average potential energy of the system and U(qj) the actual potential energy as a 
function of the molecular configuration, then show that 


S = Nk 


In 


V /2nmkT\ 3/2 ] 

N\ h 2 ) } 


5 

2 


In what sense is it justified to refer to the quantity V as the “free volume” of the system? 
Substantiate your answer by considering a particular case — for example, the case of a hard sphere 
gas. 

3.13. (a) Evaluate the partition function and the major thermodynamic properties of an ideal gas 

consisting of Ni molecules of mass m\ and N 2 molecules of mass m 2 , confined to a space 
of volume V at temperature T. Assume that the molecules of a given kind are mutually 
indistinguishable, while those of one kind are distinguishable from those of the other kind. 

(b) Compare your results with the ones pertaining to an ideal gas consisting of (N\ + N 2 ) 
molecules, all of one kind, of mass m, such that m(Ni + N 2 ) = m\N\+ m 2 N 2 . 

3.14. Consider a system of N classical particles with mass m moving in a cubic box with volume V = L 3 . 
The particles interact via a short-ranged pair potential M(r,y) and each particle interacts with each 
wall with a short-ranged interaction MwaiiCz), where z is the perpendicular distance of a particle 
from the wall. Write down the Lagrangian for this model and use a Legendre transformation to 
determine the Hamiltonian H. 

(a) Show that the quantity P = — (j ^ j can clearly be identified as the instantaneous 

pressure — that is, the force per unit area on the walls. 

(b) Reconstruct the Lagrangian in terms of the relative locations of the particles inside the box 
r, = Lsj, where the variables s, all lie inside a unit cube. Use a Legendre transformation to 
determine the Hamiltonian with this set of variables. 

(c) Recalculate the pressure using the second version of the Hamiltonian. Show that the pressure 
now includes three contributions: 

(1) a contribution proportional to the kinetic energy, 

(2) a contribution related to the forces between pairs of particles, and 

(3) a contribution related to the force on the wall. 
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Show that in the thermodynamic limit the third contribution is negligible compared to the other 
two. Interpret contributions 1 and 2 and compare to the virial equation of state (3.7.15). 

3.15. Show that the partition function Qn(V, T) of an extreme relativistic gas consisting of TV monatomic 
molecules with energy-momentum relationship e = pc, c being the speed of light, is given by 


Qn(V,T) = 


1 

iV! 



N 


Study the thermodynamics of this system, checking in particular that 


PV=^U, U/N = 3kT, and y= 

Next, using the inversion formula (3.4.7) , derive an expression for the density of states g(E) of this 
system. 

3.16. Consider a system similar to the one in the preceding problem but consisting of 3TV particles 
moving in one dimension. Show that the partition function in this case is given by 


Q3n(L, T) = 


(3 TV)! 


2 L 



3N 


L being the “length” of the space available. Compare the thermodynamics and the density of states 
of this system with the corresponding quantities obtained in the preceding problem. 

3.17. If we take the function/)^, p) in equation (3.5.3) to be U — H(q,p), then clearly (/) = 0; formally, 
this would mean 


J [U-H(q,p)]e~ tiH ^P ) dw = 0. 


Derive, from this equation, expression (3.6.3) for the mean-square fluctuation in the energy of a 
system embedded in the canonical ensemble. 

3.18. Show that for a system in the canonical ensemble 


<(A£) 3 ) = k 2 


j-'A 



+ 2 T 3 C v 


Verify that for an ideal gas 



2 

3TV 


and 



8 

97V 1 ' 


3.19. Consider the long-time averaged behavior of the quantity dG/dt, where 

G =J2 ( liPi' 

i 

and show that the validity of equation (3.7.5) implies the validity of equation (3.7.6), and vice versa. 

3.20. Show that, for a statistical system in which the interparticle potential energy u(r) is a 
homogeneous function (of degree n) of the particle coordinates, the virial V is given by 


V = -3 PV-nU 
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and, hence, the mean kinetic energy K by 

K = - l -V=\ (3 PV +nU)= —OPP + nE); 

2 2 (n + 2) 

here, U denotes the mean potential energy of the system while E = K+U. Note that this result 
holds not only for a classical system but for a quantum-mechanical one as well. 

3.21. (a) Calculate the time-averaged kinetic energy and potential energy of a one-dimensional 

harmonic oscillator, both classically and quantum-mechanically, and show that the results 
obtained are consistent with the result established in the preceding problem (with n = 2). 

(b) Consider, similarly, the case of the hydrogen atom (n = -1) on the basis of (i) the Bohr- 
Sommerfeld model and (ii) the Schrodinger model. 

(c) Finally, consider the case of a planet moving in (i) a circular orbit or (ii) an elliptic orbit around 
the sun. 

3.22. The restoring force of an anharmonic oscillator is proportional to the cube of the displacement. 
Show that the mean kinetic energy of the oscillator is twice its mean potential energy. 

3.23. Derive the virial equation of state equation (3.7.15) from the classical canonical partition function 
(3.5.5). Show that in the thermodynamic limit the interparticle terms dominate the ones that come 
from interactions of the particles with the walls of the container. 

3.24. Show that in the relativistic case the equipartition theorem takes the form 

(m 0 u 2 (l — u 2 /c 2 ) -1 / 2 ) = 3A.T, 

where m 0 is the rest mass of the particle and u its speed. Check that in the extreme relativistic case 
the mean thermal energy per particle is twice its value in the nonrelativistic case. 

3.25. Develop a kinetic argument to show that in a noninteracting system the average value of the 
quantity J2i PiPi is precisely equal to 3 PV. Hence show that, regardless of relativistic 
considerations, PV = NkT. 

3.26. The energy eigenvalues of an s-dimensional harmonic oscillator can be written as 

ej = (J + s/2)ha>; j= 0,1,2,... 

Show that the;th energy level has a multiplicity (J + s — 1)! /_/'! (s - 1)!. Evaluate the partition 
function, and the major thermodynamic properties, of a system of N such oscillators, and compare 
your results with a corresponding system of sN one-dimensional oscillators. Show, in particular, 
that the chemical potential p s = spi- 

3.27. Obtain an asymptotic expression for the quantity \ng(E) for a system of N quantum-mechanical 
harmonic oscillators by using the inversion formula (3.4.7) and the partition function (3.8.15). 
Hence show that 

S _/ E 1\ / E 1\ EE 1\ / E 1\ 

Nk ~ \Nha) + 2) ln \Nha> + 2/ \Nha> 2j ln \Nhw 2)' 

[Flint: Employ the Darwin-Fowler method.] 

3.28. (a) When a system of N oscillators with total energy E is in thermal equilibrium, what is the 

probability p n that a particular oscillator among them is in the quantum state «? 

[Hint: Use expression (3.8.25).] 

Show that, for AT 3> 1 andf?» n, p„ ~ («)"/(« + 1)" +1 , where n = R/N. 

(b) When an ideal gas of N monatomic molecules with total energy E is in thermal equilibrium, 
show that the probability of a particular molecule having an energy in the neighborhood of e 
is proportional to exp(— f)e), where /3 = 3N/2E. 

[Hint: Use expression (3.5.16) and assume that IV » 1 and /•' t.[ 

3.29. The potential energy of a one-dimensional, anharmonic oscillator may be written as 

V(q) = cq z -gq 3 -fq 4 , 

where c, g, and/ are positive constants; quite generally, g and/ may be assumed to be very small 
in value. Show that the leading contribution of anharmonic terms to the heat capacity of the 
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oscillator, assumed classical, is given by 



5 g^\ 
4 c 3 ) 


T 


and, to the same order, the mean value of the position coordinate q is given by 

3gkT 
4 c 2 


3.30. The energy levels of a quantum-mechanical, one-dimensional, anharmonic oscillator may be 
approximated as 


Sn — 



hco — x 



n = 0,1,2,... 


The parameter x, usually <sc 1, represents the degree of anharmonicity. Show that, to the first order 
in x and the fourth order in u(= hco/kT), the specific heat of a system of N such oscillators is given 
by 


C = Nk 


1- u 2 

12 


240 


-4xl — 
. u 



Note that the correction term here increases with temperature. 

3.31. Study, along the lines of Section 3.8, the statistical mechanics of a system of N “Fermi oscillators," 
which are characterized by only two eigenvalues, namely 0 and e. 

3.32. The quantum states available to a given physical system are (i) a group of gi equally likely states, 
with a common energy ei and (ii) a group of g 2 equally likely states, with a common energy e 2 > £i • 
Show that this entropy of the system is given by 

S= —k[p\ ln(pi/gi) + p 2 ln(p 2 /g 2 )l. 


where pi and p 2 are, respectively, the probabilities of the system being in a state belonging to group 
1 or to group 2: p\ + p 2 = 1. 

(a) Assuming that the p l are given by a canonical distribution, show that 


S = k 


Ingi +ln{l + (g 2 /gi)e x ] + 


x 

l + (gi/g2)e*J’ 


where x = (e 2 — £\)/kT, assumed positive. Compare the special case gi = g 2 = 1 with that of the 
Fermi oscillator of the preceding problem. 

(b) Verify the foregoing expression for S by deriving it from the partition function of the system. 

(c) Check that at T -> 0, S ->■ /clngi. Interpret this result physically. 

3.33. Gadolinium sulphate obeys Langevin’s theory of paramagnetism down to a few degrees Kelvin. Its 
molecular magnetic moment is 7.2 x 10 23 amp-m 2 . Determine the degree of magnetic saturation 
in this salt at a temperature of 2 K in a field of flux density 2 weber/m 2 . 

3.34. Oxygen is a paramagnetic gas obeying Langevin’s theory of paramagnetism. Its susceptibility 
per unit volume, at 293 K and at atmospheric pressure, is 1.80 x 10~ 6 mks units. Determiners 
molecular magnetic moment and compare it with the Bohr magneton (which is very nearly equal 
to 9.27 x 10~ 24 amp-m 2 ). 

3.35. (a) Consider a gaseous system of N noninteracting, diatomic molecules, each having an electric 

dipole moment fi, placed in an external electric field of strength E. The energy of such a 
molecule will be given by the kinetic energy of rotation as well as translation plus the potential 
energy of orientation in the applied field: 


e = 


2m 


Pg . P§ 1 

21 21 sin 2 d J 


— pEcos 9 
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where I is the moment of inertia of the molecule. Study the thermodynamics of this system, 
including the electric polarization and the dielectric constant. Assume that (i) the system is a 
classical one and (ii) \fiE\ <£ kT. 16 

(b) The molecule H 2 O has an electric dipole moment of 1.85 x 10 -18 e.s.u. Calculate, on the basis 
of the preceding theory, the dielectric constant of steam at 100°C and at atmospheric pressure. 

3.36. Consider a pair of electric dipoles fi and /P, oriented in the directions and (()',<]>'), 

respectively; the distance R between their centers is assumed to be fixed. The potential energy in 
this orientation is given by 


—-^j-{2cos0cos6»' — sin$sin0'cos(0 — </>')). 

Now, consider this pair of dipoles to be in thermal equilibrium, their orientations being governed 
by a canonical distribution. Show that the mean force between these dipoles, at high temperatures, 
is given by 


(Mb') 2 R 
kT R 7 ’ 


R being the unit vector in the direction of the line of centers. 

3.37. Evaluate the high-temperature approximation of the partition function of a system of magnetic 
dipoles to show that the Curie constant Cj is given by 


Cj 


Nog 2 ^ —2 
-;—- m l . 


Hence derive the formula (3.9.26). 

3.38. Replacing the sum in (3.9.18) by an integral, evaluate Qi(p) of the given magnetic dipole and study 
the thermodynamics following from it. Compare these results with the ones following from the 
Langevin theory. 

3.39. Atoms of silver vapor, each having a magnetic moment fi B (g = 2,J = g), align themselves either 
parallel or antiparallel to the direction of an applied magnetic field. Determine the respective 
fractions of atoms aligned parallel and antiparallel to a field of flux density 0.1 weber/m 2 at a 
temperature of 1,000 K. 

3.40. (a) Show that, for any magnetizable material, the heat capacities at constant field H and at 

constant magnetization M are connected by the relation 


Ch~Cm = -T 




(b) Show that for a paramagnetic material obeying Curie's law 

C h -Cm = CH 2 /T 2 , 


where C on the right side of this equation denotes the Curie constant of the given sample. 

3.41. A system of N spins at a negative temperature (E > 0) is brought into contact with an ideal-gas 
thermometer consisting of N' molecules. What will the nature of their state of mutual equilibrium 
be? Will their common temperature be negative or positive, and in what manner will it be affected 
by the ratio Af'/AT? 

3.42. Consider the system of N magnetic dipoles, studied in Section 3.10, in the microcanonical 
ensemble. Enumerate the number of microstates, Q(N,E), accessible to the system at energy!? 
and evaluate the quantities S(N,E) and T(N,E). Compare your results with equations (3.10.8) 
and (3.10.9). 


16 The electric dipole moments of molecules are generally of order 10 18 e.s.u. (or a Debye unit). In a field of 1 e.s.u. 
(= 300volts/cm) and at a temperature of 300 K, the parameter PfiE = O(10 -4 ). 
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3.43. Consider a system of charged particles (not dipoles), obeying classical mechanics and classical 
statistics. Show that the magnetic susceptibility of this system is identically zero (Bohr-van 
Leeuwen theorem). 

[Note that the Hamiltonian of this system in the presence of a magnetic field H(= V x A) will be a 
function of the quantities + ( ej/c)A(rj ), and not of the pj as such. One has now to show that the 
partition function of the system is independent of the applied field.] 

3.44. The expression (3.3.13) for the entropy S is equivalent to Shannon’s (1949) definition of the 
information contained in a message I = — P r ln(P r ), where P, represents the probability of 
message r. 

(a) Show that information is maximized if the probabilities of all messages are the same. Any other 
distribution of probabilities reduces the information. In English, “e” is more common than “z”, 
so P e > P„ so the information per character in an English message is less than the optimal 
amount possible based on the number of different characters used in an English text. 

(b) The information in a text is also affected by correlations between characters in the text. For 
example, in English, “q” is always followed by “u”, so this pair of characters contains the same 
information as “q” alone. The probability of a character indexed by r followed immediately 
by character indexed by r’ is = P r P,i G r y, where G r y is the character-pair correlation 
function. If pairs of characters are uncorrelated, then G r , r ' = 1. Show that if characters are 
uncorrelated then the information in a two-character message is twice the information of a 
single-character message and that correlations (G,,,-' ^ 1) reduce the information content. 
[Hint: Use the inequality lnx < x — 1.] 

(c) Write a computer program to determine the information per character in a text file by 
determining the single-character probabilities P r and character-pair correlations G r y ■ 
Computers usually use one full byte per character to store information. Since one byte can 
store 256 different messages, the potential information per byte is ln256 = 81n2 = 8bits. Show 
that the information per character in your text file is considerably less than 8 bits and explain 
why it is possible for file-compression algorithms to reduce the size of a computer file without 
sacrificing any of the information contained in the file. 



The Grand Canonical Ensemble 


In the preceding chapter we developed the formalism of the canonical ensemble and 
established a scheme of operations for deriving the various thermodynamic properties of a 
given physical system. The effectiveness of that approach became clear from the examples 
discussed there; it will become even more vivid in the subsequent studies carried out in 
this text. However, for a number of problems, both physical and chemical, the usefulness 
of the canonical ensemble formalism turns out to be rather limited and it appears that a 
further generalization of this formalism is called for. The motivation that brings about this 
generalization is physically of the same nature as the one that led us from the microcanoni- 
cal to the canonical ensemble — it is just the next natural step from there. It comes from the 
realization that not only the energy of a system but the number of particles as well is hardly 
ever measured in a “direct” manner; we only estimate it through an indirect probing into 
the system. Conceptually, therefore, we may regard both N and E as variables and identify 
their expectation values, (AT) and (E ), with the corresponding thermodynamic quantities. 

The procedure for studying the statistics of the variables N and E is self-evident. We 
may either (i) consider the given system A as immersed in a large reservoir A' with which it 
can exchange both energy and particles or (ii) regard it as a member of what we may call 
a grand canonical ensemble, which consists of the given system A and a large number of 
(mental) copies thereof, the members of the ensemble carrying out a mutual exchange of 
both energy and particles. The end results, in either case, are asymptotically the same. 

4.1 Equilibrium between a system 
and a particle-energy reservoir 

We consider the given system A as immersed in a large reservoir A', with which it can 
exchange both energy and particles; see Figure 4.1. After some time has elapsed, the system 
and the reservoir are supposed to attain a state of mutual equilibrium. Then, according 
to Section 1.3, the system and the reservoir will have a common temperature T and a 
common chemical potential /i. The fraction of the total number of particles N i()> and the 
fraction of the total energy £ (0) that the system A can have at any time t are, however, 
variables (whose values, in principle, can lie anywhere between zero and unity). If, at a 
particular instant of time, the system A happens to be in one of its states characterized by 
the number N r of particles and the amount E s of energy, then the number of particles in 
the reservoir would be N' r and its energy E' s , such that 

N r + N' r = AT (0) = const. (1) 
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FIGURE 4.1 A statistical system immersed in a particle-energy reservoir. 


and 


E s + E' s = E (0) — const. 


( 2 ) 


Again, since the reservoir is supposed to be much larger than the given system, the values 
of N r and E s that are going to be of practical importance will be very small fractions of the 
total magnitudes Af (0) and E (0) , respectively; therefore, for all practical purposes, 1 


N r 

JV<°> 



(3) 


and 


Es_ 
£t °> 



(4) 


Now, in the manner of Section 3.1, the probability P r>s that, at any time t, the sys¬ 
tem A is found to be in an (N r ,E s )-state would be directly proportional to the number 
of microstates Q'(N' n E' s ) that the reservoir can have for the corresponding macrostate 
(N'.,E' S ). Thus, 


P r ,s OC £2'(N {0) - N r ,E (0) - Eg). 
Again, in view of (3) and (4), we can write 


(5) 


\nQ,’(N m - N r ,E (0) - E s ) = lnQ'(N {0 \E {0) ) 


\ 3W- V ) E ,, E m 

~ In &(N m ,E (0) ) + ^N r - r^Eg) 
kv kr 


{—E s ) + ■ ■ ■ 


( 6 ) 


see equations (1.2.3), (1.2.7), (1.3.3), and (1.3.5). Here, p! and V are, respectively, the 
chemical potential and the temperature of the reservoir (and hence of the given system 

'Note that A here could well be a relatively small “part” of a given system j 4 (0) , whiled' represents the “rest” of A 0 '. 
That would give a truly practical perspective to the grand canonical formalism. 
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as well). From (5) and (6), we obtain the desired result: 

P r ,s OC exp (~ceN r - I3E S ), (7) 


where 


a = —n/kT, /S = 1 /kT. 


( 8 ) 


On normalization, it becomes 


p„. «p . (9) 

£exp (-ctNr-pEs) 
r,s 

the summation in the denominator goes over all the (N r ,E s )-states accessible to the 
system A. Note that our final expression for P lyS is independent of the choice of the 
reservoir. 

We shall now examine the same problem from the ensemble point of view. 


4.2 A system in the grand canonical ensemble 

We now visualize an ensemble of W identical systems (which, of course, can be labeled as 
1,2,..., W) mutually sharing a total number of particles 2 W N and a total energy W E. Let 
n r>s denote the number of systems that have, at any time t, the number N r of particles and 
the amount^ of energy (r,s = 0,1,2,...); then, obviously, 

^ra r , s = W, (la) 

r,s 

J2n r , s N r = ^rN, (lb) 

r,s 

and 


'^ j n r:S E s = 'N'E. (lc) 

r,s 

Any set {n r ,s}> of the numbers n ryS , which satisfies the restrictive conditions (1), represents 
one of the possible modes of distribution of particles and energy among the members 
of our ensemble. Furthermore, any such mode of distribution can be realized in W{n ryS ) 
different ways, where 


W{n r , s } 


W! 

r,s 


( 2 ) 


2 For simplicity, we shall henceforth use the symbols N and E instead of ( N) and (£). 
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We may now define the most probable mode of distribution, {n* s }, as the one that 
maximizes expression (2), satisfying at the same time the restrictive conditions (1). Going 
through the conventional derivation, see Section 3.2, we obtain for a large ensemble 

K,s _ exp (—aN r — pE s ) 

W E ex P (~aN r - PE s ) ’ 

r,s 

compare to the corresponding equation (3.2.10) for the canonical ensemble. Alternatively, 
we may define the expectation (or mean) values of the numbers n r , s , namely 


(n r , s ) 


y n r ,s^V{rir,s} 

f«r,s) 

E' W{n r ,s} 

{ n r ,s ) 


(4) 


where the primed summations go over all distribution sets that conform to conditions (1). 
An asymptotic expression for (n r>s ) can be derived by using the method of Darwin and 
Fowler — the only difference from the corresponding derivation in Section 3.2 being that, 
in the present case, we will have to work with functions of more than one (complex) 
variable. The derivation, however, runs along similar lines, with the result 

Lim l ' nr,s) ~ n *- s - exp {-aNr - PE s ) 

W — e/V E ex P (—aN r — pE s )’ 

r,s 


in agreement with equation (4.1.9). The parameters a and p, so far undetermined, are 
eventually determined by the equations 


E N r exp (~aN r ~ PE S ) 

r,s 

E exp (~aN r - PE S ) 

r,s 


3 

da 


In exp (—aN r - pE s ) 


r,s 


( 6 ) 


and 


E Es exp (—aN r — pE s ) 

E = Li _ 

E exp (~aN r - PE S ) 

r,s 

where the quantities N and E here are supposed to be preassigned. 


^ | ln Y eX P (-aN r - PE S ) 


( 7 ) 
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4.3 Physical significance of the various 
statistical quantities 

To establish a connection between the statistics of the grand canonical ensemble and the 
thermodynamics of the system under study, we introduce a quantity q, defined by 

^ = In 1 ex P ~ P Es) | : (1) 

the quantity q is a function of the parameters a and p, and also of all the E s . 3 Taking the 
differential of q and making use of equations (4.2.5), (4.2.6), and (4.2.7), we get 

dq = —Nda — Edp — y^(w r ,s) dE s , ( 2 ) 


so that 


d(cf + otN + pE) = p 


1-dN+dE- 

V 


1 


y , (Mr,s) dE s 
r,s 


( 3 ) 


To interpret the terms appearing on the right side of this equation, we compare the 
expression enclosed within the parentheses with the statement of the first law of thermo¬ 
dynamics, that is, 


8Q = dE + 8W — fidN, ( 4 ) 

where the various symbols have their usual meanings. The following correspondence now 
seems inevitable: 

sw =-^E, {nr - s)dEs ’ E = -oc/p, ( 5 ) 


with the result that 


d(q + aN+pE) = p8Q. (6) 

The parameter /3, being the integrating factor for the heat SQ, must be equivalent to the 
reciprocal of the absolute temperature T, so we may write 

P = 1 /kT ( 7 ) 


and, hence, 


a = —/ i/kT. 


( 8 ) 


3 This quantity was first introduced by Kramers, who called it the q-potential. 
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The quantity (q + aN + fiE) would then be identified with the thermodynamic variable 
S/k; accordingly, 



aN — /3E = 


TS + fiN-E 
kf 


(9) 


However, / iN is identically equal to G, the Gibbs free energy of the system, and hence to 
(E-TS+PV). So, finally, 

^ = ln |Xl ex P (_ “ iVr_ ' s£s) | = ^F- GO) 

Equation (10) provides the essential link between the thermodynamics of the given sys¬ 
tem and the statistics of the corresponding grand canonical ensemble. It is, therefore, a 
relationship of central importance in the formalism developed in this chapter. 

To derive further results, we prefer to introduce a parameter z, defined by the relation 

z = e~ a = (11) 


the parameter z is generally referred to as the fugacity of the system. In terms of z, the 
q-potential takes the form 


q = In 



= In 


OO 

J2 z Nr Q Nr (V,T) 

N r =0 


(with Qo = 1), 


( 12 ) 

(13) 


so we may write 


q(z, V, T) = ln<9(z, V, T), 


(14) 


where 


fl(z, V, T) = z Nr Q Nr (V, T) (with Qo = 1). (15) 

N r = 0 

Note that, in going from expression (12) to (13), we have (mentally) carried out a sum¬ 
mation over the energy values E s , with N r fixed, thus giving rise to the partition function 
Q Nr (Y, T ); of course, the dependence of Qjv,. on V comes from the dependence of the E s 
on V. In going from (13) to (14), we have (again mentally) carried out a summation over all 
the numbers N r = 0,1,2, ■ ■ ■, oo, thus giving rise to the grand partition function Q(z, V, T) 
of the system. The ^-potential, which we have already identified with PV/kT, is, therefore, 
the logarithm of the grand partition function. 
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It appears that in order to evaluate the grand partition function Q(z, V, T) we have to 
go through the routine of evaluating the partition function Q(N, V, T). In principle, this is 
indeed true. In practice, however, we find that on many occasions an explicit evaluation 
of the partition function is extremely hard while considerable progress can be made in 
the evaluation of the grand partition function. This is particularly true when we deal with 
systems in which the influence of quantum statistics and/or interparticle interactions is 
important; see Sections 6.2 and 10.1. The formalism of the grand canonical ensemble then 
proves to be of considerable value. 

We are now in a position to write down the full recipe for deriving the leading ther¬ 
modynamic quantities of a given system from its q-potential. We have, first of all, for the 
pressure of the system 


P(z, V, T) = —g(z, W T) = — In Q.(z, V, T). (16) 

Next, writing N for N and U for E, we obtain with the help of equations (4.2.6), (4.2.7), 
and (11) 



' 3 


' 3 

N(z, V,T) = z 

—q(z, V, T) 
_az 

II 

—q(li,V, T) 
_3/i 


(17) 


and 


U(z, V, T) = - 


—q(z, V, T) 
dp 


= kT z 


z,V 


—q(z, V, T) 


z,V 


(18) 


Eliminating z between equations (16) and (17), one obtains the equation of state, that is, 
the (P, V, T) -relationship, of the system. On the other hand, eliminating z between equa¬ 
tions (17) and (18), one obtains U as a function of N, V, and T, which readily leads to the 
specific heat at constant volume as ( dU/dT)N,v■ The Helmholtz free energy is given by the 
formula 


A = N/x — PV = NkTlnz - kT\n&(z, V, T) 


= —fcrin 


<2(z, V, T) 


(19) 


which may be compared with the canonical ensemble formula A — — kT In Q(N, V, T); see 
also Problem 4.2. Finally, we have for the entropy of the system 


S = 


U-A 



T 


z,V 


Nk\nz+ kq. 


( 20 ) 
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4.4 Examples 

We shall now study a couple of simple problems, with the explicit purpose of demonstrat¬ 
ing how the method of the ^-potential works. This is not intended to be a demonstration of 
the power of this method, for we shall consider here only those problems that can be solved 
equally well by the methods of the preceding chapters. The real power of the new method 
will become apparent only when we study problems involving quantum-statistical effects 
and effects arising from interparticle interactions; many such problems will appear in the 
remainder of the text. 

The first problem we propose to consider here is that of the classical ideal gas. In 
Section 3.5 we showed that the partition function Qjv( V, T ) of this system could be 
written as 


Qn(V,T) = 


[Qi(k,r)] w 

N\ 


( 1 ) 


where Qi(V, T) may be regarded as the partition function of a single particle in the sys¬ 
tem. First of all, we should note that equation (1) does not imply any restrictions on 
the particles having internal degrees of motion; those degrees of motion, if present, 
would affect the results only through Q\. Second, we should recall that the factor N\ 
in the denominator arises from the fact that the particles constituting the gas are, in 
fact, indistinguishable. Closely related to the indistinguishability of the particles is the 
fact that they are nonlocalized, for otherwise we could distinguish them through their 
very sites; compare, for instance, the system of harmonic oscillators, which was studied 
in Section 3.8. Now, since our particles are nonlocalized they can be anywhere in the 
space available to them; consequently, the function Qi will be directly proportional 
to V: 


Q 1 (V,T) = Vf(T), (2) 

where/(T) is a function of temperature alone. We thus obtain for the grand partition 
function of the gas 


Cl(z, V,T)=J2 zNr QN r (V, T)=jr {zV f^ ]N ' 

N r =0 N r =0 r ' 


= exp{zVf(T)}, 


(3) 


which gives 


q(z,V,T)=zVf(T). 


( 4 ) 
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Formula (4.3.16) through (4.3.20) then lead to the following results: 


P = zkTf(T), 

(5) 

N = zVf(T), 

(6) 

U = zVkT 2 f (T), 

(7) 

A = NkTlnz - zVkTf(T), 

(8) 


and 


S=-Nklnz + zVk{Tf(T)+f(T)}. (9) 

Eliminating z between (5) and (6), we obtain the equation of state of the system: 


PV = NkT. (10) 

We note that equation (10) holds irrespective of the form of the function f(T). Next, 
eliminating z between (6) and (7), we obtain 

U = NkT 2 f(T)/f(T), (11) 


which gives 


C v = Nk 


2Tf(T)f\T) + T 2 {f{T)f"(T) - \f'(T)] 2 } 
\f(T) ]2 


( 12 ) 


In simple cases, the function/(T) turns out to be directly proportional to a certain 
power of T. Supposing that/(T) cx T n , equations (11) and (12) become 


U = n(NkT) 


(11a) 


and 


CV = n(Nk). (12a) 

Accordingly, the pressure in such cases is directly proportional to the energy density of the 
gas, the constant of proportionality being 1 /n. The reader will recall that the case n = 3/2 
corresponds to a nonrelativistic gas while n = 3 corresponds to an extreme relativistic one. 

Finally, eliminating z between equation (6) and equations (8) and (9), we obtain A 
and S as functions of N, V, and T. This essentially completes our study of the classical 
ideal gas. 

The next problem to be considered here is that of a system of independent, localized 
particles — a model which, in some respects, approximates a solid. Mathematically, the 
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problem is similar to that of a system of harmonic oscillators. In either case, the micro¬ 
scopic entities constituting the system are mutually distinguishable. The partition function 
Qn(V, T) of such a system can be written as 

Qn(V, T)= [Qi(V,T)] n . (13) 

At the same time, in view of the localized nature of the particles, the single-particle par¬ 
tition function Qi ( V, T) is essentially independent of the volume occupied by the system. 
Consequently, we may write 


Qi(V,T) = 4(T), (14) 

where 4>{T) is a function of temperature alone. We then obtain for the grand partition 
function of the system 


OO 

<2(z, V,T) = \z<KT)\ Nt = {\-zMT)T l \ 

N r =0 


(15) 


clearly, the quantity z</>(D must stay below unity, so that the summation over N r is 
convergent. 

The thermodynamics of the system follows straightforwardly from equation (15). We 
have, to begin with, 

kT kT 

P=— q(z,T) = ~ — ln{l-z0(r)}. (16) 


Since both z and T are intensive variables, the right side of (16) vanishes as V -»• oo. Hence, 
in the thermodynamic limit, P = 0. 4 For other quantities of interest, we obtain, with the 
help of equations (4.3.17) through (4.3.20), 


N — Z * {T) 

(17) 

1 -z<t>(T)’ 

jj zkT 2 <t>'(T) 

(18) 

1 ~z4>{T) ’ 

A = Nfcrinz+fcrinjl — z<j>{T)}, 

(19) 


and 


From (17), we get 


S = —Nkinz- fcln{l - ztf>(T)} + 

1 -z4>(T) 


z<p(T) = 


N 

N+ 1 



(IV » 1). 


( 20 ) 


( 21 ) 


It will be seen in the sequel that P actually vanishes like (In N)/N. 
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It follows that 



( 22 ) 


Equations (17) through (20) now give 


U/N = kT 2 <t>'(T)/4>(T), 
A/N = —kTh\4>(T) + O 



(18a) 


(19a) 


and 


S/Nk = In (HT 1 ) + T<t>'(T)l4>(T) + O 



(20a) 


Substituting 


(p(T) = [2sinh(fi,to/2fc7’)]~ 1 


(23) 


into these formulae, we obtain results pertaining to a system of quantum-mechanical, 
one-dimensional harmonic oscillators. The substitution 


^(T) = kT/ha>, 


(24) 


on the other hand, leads to results pertaining to a system of classical, one-dimensional 
harmonic oscillators. 

As a corollary, we examine here the problem of solid-vapor equilibrium. Consider a 
single-component system, having two phases — solid and vapor — in equilibrium, con¬ 
tained in a closed vessel of volume V at temperature T. Since the phases are free to 
exchange particles, a state of mutual equilibrium would imply that their chemical poten¬ 
tials are equal; this, in turn, means that they have a common fugacity as well. Now, the 
fugacity z g of the gaseous phase is given by, see equation (6), 


where N g is the number of particles in the gaseous phase and V g the volume occupied by 
them; in a typical case, V g — V. The fugacity z s of the solid phase, on the other hand, is 
given by equation (21): 


1 



(26) 


Equating (25) and (26), we obtain for the equilibrium particle density in the vapor phase 

N g /V g =f(T)/<P(T). (27) 





102 Chapter 4 • The Grand Canonical Ensemble 


Now, if the density in the vapor phase is sufficiently low and the temperature of the system 
sufficiently high, the vapor pressure P would be given by 

N f(T) 

P v anor = — kT = kT —-. (28) 

p Vg 

To be specific, we may assume the vapor to be monatomic; the function/(T) is then of 
the form 


f(T) = (2nmkT) 3/2 /h 3 . (29) 

On the other hand, if the solid phase can be approximated by a set of three-dimensional 
harmonic oscillators characterized by a single frequency w (the Einstein model), the 
function <j>(T) would be 


<p(T) = [2smh(hco/2kT)]~ 3 . (30) 

However, there is one important difference here. An atom in a solid is energetically more 
stabilized than an atom that is free — that is why a certain threshold energy is required to 
transform a solid into separate atoms. Let e denote the value of this energy per atom, which 
in a way implies that the zeros of the energy spectra e g and e s , which led to the functions 
(29) and (30), respectively, are displaced with respect to one another by an amount e. A true 
comparison between the functions/(D and <p(T) must take this into account. As a result, 
we obtain for the vapor pressure 

P vap0I = kT( 2jT ™ kT ^ ' [2sinh(hoi/2kT)] 3 e~ s i kT . (31) 

In passing, we note that equation (27) also gives us the necessary condition for the 
formation of the solid phase. The condition clearly is: 

f( D 

N > V— —(32) 
4>(T)’ 

where N is the total number of particles in the system. Alternatively, this means that 

T < T c , (33) 


where T c is a characteristic temperature determined by the implicit relationship 

/(T c ) _ N 
<p(T c ) V 


(34) 


Once the two phases appear, the number N g (T ) will have a value determined by equa¬ 
tion (27) while the remainder, N — N g , will constitute the solid phase. 
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4.5 Density and energy fluctuations in the grand 
canonical ensemble: correspondence 
with other ensembles 

In a grand canonical ensemble, the variables N and E, for any member of the ensemble, 
can lie anywhere between zero and infinity. Therefore, on the face of it, the grand canoni¬ 
cal ensemble appears to be very different from its predecessors — the canonical and the 
microcanonical ensembles. However, as far as thermodynamics is concerned, the results 
obtained from this ensemble turn out to be identical to the ones obtained from the other 
two. Thus, in spite of strong facial differences, the overall behavior of a given physical sys¬ 
tem is practically the same whether it belongs to one kind of ensemble or another. The 
basic reason for this is that the “relative fluctuations’’ in the values of the quantities that 
vary from member to member in an ensemble are practically negligible. Therefore, in spite 
of the different surroundings that different ensembles provide to a given physical system, 
the overall behavior of the system is not significantly affected. 

To appreciate this point, we shall evaluate the relative fluctuations in the particle den¬ 
sity n and the energy E of a given physical system in the grand canonical ensemble. 
Recalling that 


N = 


Y^Nre- 01 ^-^ 

r,s 

Y e - aN r-P E s * 
r,s 


(i) 


it readily follows that 


=_^2+]V 2 . 

aa K,B S 


( 2 ) 


Thus 




dN 


dN 


(3) 


From (3), we obtain for the relative mean-square fluctuation in the particle density 
n (= N/V) 


(An) 2 (AN) 2 kT / dN 


n 2 


N 2 N 2 \ dll 


(4) 


T,V 


In terms of the variable v (= V/N), we may write 


(An) 2 kTv 2 /d(V/v)\ kT (dv\ 

n 2 V 2 \ 3 li ) t,v v \3 li )t 


( 5 ) 
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To put this result into a more practical form, we recall the thermodynamic relation 


dp = vdP — sdT, 


( 6 ) 


according to which dp. (at constant T) = vdP. Equation (5) then takes the form 


(An) 2 


n 


2 


kT 1 | 

tdv\ 

i kT 

~Vv\ 

K dP; 

) t =^ KT ’ 


(7) 


where kj is the isothermal compressibility of the system. 

Thus, the relative root-mean-square fluctuation in the particle density of the given sys¬ 
tem is ordinarily 0(N~ l l 2 ) and, hence, negligible. However, there are exceptions, like the 
ones met with in situations accompanying phase transitions. In those situations, the com¬ 
pressibility of a given system can become excessively large, as is evidenced by an almost 
“flattening” of the isotherms. For instance, at a critical point the compressibility diverges, 
so it is no longer intensive. Finite-size scaling theory described in Chapters 12 and 14 indi¬ 
cates that at the critical point the isothermal compressibility scales with system size as 
k t {T c ) ~ N y i dv where y and v are certain critical exponents and d is the dimension. For 
the case of experimental liquid-vapor critical points, k t (T c ) ~ /V 0,63 . Accordingly, the root- 
mean-square density fluctuations grow faster than /V 1/2 — in this case, like /V 0 - 82 . Thus, 
in the region of phase transitions, especially at the critical points, we encounter unusu¬ 
ally large fluctuations in the particle density of the system. Such fluctuations indeed exist 
and account for phenomena like critical opalescence. It is clear that under these circum¬ 
stances the formalism of the grand canonical ensemble could, in principle, lead to results 
that are not necessarily identical to the ones following from the corresponding canonical 
ensemble. In such cases, it is the formalism of the grand canonical ensemble that will have 
to be preferred because only this one will provide a correct picture of the actual physical 
situation. 

We shall now examine fluctuations in the energy of the system. Following the usual 
procedure, we obtain 


(AH ) 2 = E 2 - E 2 = - 


= kT 2 


z,v 


dU' 


3 T) z ,, 


( 8 ) 


To put expression (8) into a more comprehensible form, we write 


dU 

dT 


Z,V 


dU\ 

dT ) N,\ 


dU 

m 


T,V 


dN 

dT 


) z,V 


(9) 


where the symbol N is being used interchangeably for N. Now, in view of the fact that 


N = -(-?- In Q) , U=-(^~ In a 


da 


h.v 


3/S 


GO) 


a,V 
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we have 



) a,V 



dU\ 

) p,v 


( 11 ) 


and, hence, 



( 12 ) 


Substituting expressions (9) and (12) into equation (8) and remembering that the quantity 
(dU/dT)tty is the familiar CV, we get 



(13) 


Invoking equations (3.6.3) and (3), we finally obtain 



(14) 


Formula (14) is highly instructive; it tells us that the mean-square fluctuation in the 
energy E of a system in the grand canonical ensemble is equal to the value it would 
have in the canonical ensemble plus a contribution arising from the fact that now the 
particle number N is also fluctuating. Again, under ordinary circumstances, the relative 
root-mean-square fluctuation in the energy density of the system would be practically 
negligible. However, in the region of phase transitions, unusually large fluctuations in the 
value of this variable can arise by virtue of the second term in the formula. 


4.6 Thermodynamic phase diagrams 


One of the great successes of thermodynamics and statistical mechanics over the last 150 
years has been in the study of phase transitions. Statistical mechanics provides the basis 
for accurate models for a wide variety of thermodynamic phases of materials and has led 
to a detailed understanding of phase transitions and critical phenomena. 

Condensed materials exist in a variety of phases that depend on thermodynamic 
parameters such as temperature, pressure, magnetic field, and so on. Thermodynamics 
and statistical mechanics can be used to determine the properties of individual phases, 
and the locations and characteristics of the phase transitions that occur between those 
phases. Thermodynamic phases are regions in the phase diagram where the thermody¬ 
namics properties are analytic functions of the thermodynamic parameters, while phase 
transitions are points, lines, or surfaces in the phase diagram where the thermodynamic 
properties are nonanalytic. Much of the remainder of this text is devoted to using statistical 
mechanics to explain the properties of material phases and phase transitions. 
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FIGURE 4.2 Sketches (not-to-scale) of the P-T (a) and P-V (b) phase diagrams for argon. This geometry is generic 
for a wide range of materials. The letters S, L, and V denote solid, liquid, and vapor phases. 


It is instructive to examine the structure of phase diagrams. Argon provides a good 
example because the structure of its phase diagram is similar to that of many other mate¬ 
rials (see Figure 4.2). At moderate temperatures and pressures, the stable thermodynamic 
phases of argon are solid, liquid, and vapor. At high temperatures there is a supercritical 
fluid phase that smoothly connects the liquid and vapor phases. Most materials, includ¬ 
ing argon, exhibit multiple solid phases especially at high pressures and low temperatures. 
Figure 4.2(a) is the phase diagram in the P-T plane and shows the solid-liquid coexis¬ 
tence line, the liquid-vapor coexistence line, and the solid-vapor coexistence line. The 
three lines meet at the triple point ( T t ,P t ) and the liquid-vapor coexistence line ends at 
the critical point ( T C ,P C ). The triple point values and critical point values for argon are 
T t — 83.8 K, P t = 68.9 kPa, T c = 150.7 K, and P c = 4.86 MPa, respectively. 

Figure 4.2(b) is the phase diagram in the P-V plane and shows the pressure versus the 
specific volume v (= V / N ) on the coexistence lines. The dashed lines indicate the triple 
point pressure and critical pressure in both figures. The horizontal tie lines are the por¬ 
tions of isotherms as they cross coexistence lines and show the discontinuities of v. The tie 
lines in order from bottom to top are: sublimation tie lines connecting the solid and vapor 
phases, the triple point tie line that connects all three phases, and a series of solid-liquid 
and liquid-vapor tie lines. Notice that the liquid and vapor specific volumes continuously 
approach each other and are both equal to the critical specific volume v c at the critical 
point. 

The properties of the vapor, liquid, and solid phases are: 

• The vapor phase is a low-density gas that is accurately described by the ideal-gas 
equation of state P — nkT with corrections that are described by the virial expansion; 
see Chapters 6 and 10. 
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• The liquid phase is a dense fluid with strong interactions between the atoms. The 
fluid exhibits characteristic short-range pair correlations and scattering structure, 
as discussed in Section 10.7. The structure factor and the pair correlation function 
for argon, as determined from neutron scattering, are shown in Figure 10.8. For 
temperatures above the critical temperature T c , one cannot distinguish between liquid 
and vapor. The density in this supercritical phase is a smooth function of temperature 
and pressure from the low-density vapor to the high-density liquid. Virial expansions 
developed in Sections 10.1 through 10.3 aptly describe the supercritical region. Strictly 
speaking, one can only distinguish between the liquid and vapor phases on the 
liquid-vapor coexistence line since it is possible to evolve smoothly from one phase 

to the other without crossing a phase boundary. 

• The solid phase is a face-centered cubic crystal structure with long-range order, so the 
scattering structure factor displays Bragg peaks as described in Section 10.7.B. The 
thermodynamic properties of solid phases are described in Section 7.3. 

All equilibrium thermodynamic properties within a single phase are analytic func¬ 
tions of the thermodynamic parameters while phase transitions are defined as places in 
the phase diagram where equilibrium thermodynamic properties are not analytic. Coexis¬ 
tence lines, or first-order phase transition lines, separate different phases in the P-T phase 
diagram as shown in Figure 4.2(a). Thermodynamic densities are discontinuous across 
coexistence lines. This is displayed on the P-V phase diagram in Figure 4.2(b) by hori¬ 
zontal tie lines that connect different values the specific volume takes in the two phases. 
Generally, all densities such as the specific volume v — V/N, entropy per particle .s' = S/N, 
internal energy density u— U/V, and so on, are discontinuous across first-order phase 
transition lines. The slopes of the coexistence lines in the P-V phase diagram depend 
on the latent heat of the transition and the specific volumes of the coexisting phases; see 
Section 4.7. All three phases coexist at the triple point. 

The liquid-vapor coexistence line extends from the triple point to the critical point at 
the end of the first-order phase transition line. The specific volume is discontinuous on the 
liquid-vapor coexistence line but the size of the discontinuty vanishes at the critical point 
where the specific volume is v c ; see Figure 4.2(b). All densities are continuous functions 
of T and P through the critical point. For this reason, critical points are called continuous 
transitions or, sometimes, second-order phase transitions. Even though thermodynamic 
densities are continuous, the thermodynamic behavior at the critical point is nonanalytic 
since, for example, the specific heat and isothermal compressibility both diverge at the 
critical point. Another characteristic property of critical points is the divergence of the cor¬ 
relation length, which results in a universal behavior of critical points for broad classes of 
materials. The theory of critical points is developed in Chapters 12, 13, and 14. 

Classical statistical mechanics provides a framework for understanding the phase dia¬ 
grams and thermodynamic properties of a wide variety of materials. However, quantum 
mechanics and quantum statistics play an important role at low temperatures when the 
size of the thermal deBroglie wavelength /, = h/\/2jtmkT is of the same order as the 
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T 


FIGURE 4.3 Sketch of the P-T phase diagram for helium-4. The letters S, L, and V denote solid, liquid, and vapor 
phases. The critical point is T c = 5.19K and P c = 227kPa = 2.24atm. The solid-liquid coexistence curve starts at 
P s = 2.5MPa = 25atm at T = OK and does not intersect the liquid-vapor coexistence curve. The A-line is the 
continuous phase transition between the normal liquid and the superfluid phase. The superfluid phase transition 
temperature at the liquid-vapor coexistence line is T; = 2.18K. 


average distance between molecules. This is the case with liquid helium at temperatures 
below a few degrees kelvin. The phase diagram of helium-4 is shown in Figure 4.3. Some 
aspects of the phase diagram are similar to the phase diagram of argon. Both helium 
and argon have liquid-vapor coexistence lines that end in critical points and both have 
crystalline solid phases at low temperatures. 

Three differences between the two phase diagrams are most notable: the solid phase 
for helium only exists for pressures greater than P s = 2.5 GPa = 25 atm, the liquid phase of 
helium extends all the way to zero temperature, and helium-4 exhibits a superfluid phase 
below T x = 2.18K. The superfluid phase exhibits remarkable properties: zero viscosity, 
quantized flow, propagating heat modes, and macroscopic quantum coherence. This 
extraordinary behavior is due to the Bose-Einstein statistics of 4 He atoms and a Bose- 
Einstein condensation into a macroscopic quantum state as discussed in Sections 7.1 and 
11.2 through 11.6. Even the solid phase of helium-4 shows evidence of a macroscopic 
quantum state with the observation of a “supersolid” phase by Kim and Chan (2004). 

By contrast, 3 He atoms obey Fermi-Dirac statistics and display very different behav¬ 
iors from 4 He atoms at low temperatures. The geometry of the phase diagram of helium-3 
is similar to that of helium-4 except that the critical temperature is lower (T c = 3.35K 
compared to 5.19K) and the solid phase forms at 30 atm of pressure rather than 25 atm. 
The dramatic difference is the lack of a superfluid phase near 1K in helium-3. Elelium- 
3 remains a normal liquid all the way down to about 10 millikelvin. The properties of the 
normal liquid phase of helium-3 are described by the theory of degenerate Fermi gases and 
the Fermi liquid theory developed in Chapter 8 and Sections 11.7 and 11.8. The superfluid 
state that forms at millikelvin temperatures is the result of Bardeen, Cooper, and Schrieffer 
(BCS) p-wave pairing between atoms near the Fermi surface; this pairing is discussed in 
Section 11.9. 
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4.7 Phase equilibrium and the Clausius-Clapeyron 
equation 

The thermodynamic properties of the phases of a material determine the geometry of the 
phase diagram. In particular, the Gibbs free energy 

G(N,P,T) = U — TS + PV =A + PV — fi(P t T)N (1) 

determines the locations of the phase boundaries. Note that the chemical potential is the 
Gibbs free energy per molecule; see Problem 4.6 and Appendix H. Consider a cylinder con¬ 
taining N molecules held at constant pressure P and constant temperature T, that is, in an 
isothermal, isobaric assembly. Suppose the cylinder initially contains two phases: vapor 
(A) and liquid (B) so that the total number of molecules is N = Na + Nr and the Gibbs 
free energy is G = Ga(Na,P, T) + Gp,(AV,, P, T ). If the two phases do not coexist at this pres¬ 
sure and temperature, the numbers of molecules in each phase will change as the system 
approaches equilibrium. As the number of molecules in each phase changes, the Gibbs 
free energy changes by an amount 

dG=(—dN/\ + (——^ dNft = (ha ~ MbI^A/a, (2) 

\oJMaJ T,P \oA/b / t,P 

where cINa is the change in the number of molecules in phase A. 

The Gibbs free energy is minimized at equilibrium, so dG < 0. If /m > /m, the number 
of molecules in phase B will increase and the number in phase A will decrease as the sys¬ 
tem approaches equilibrium. If /x a< the number of molecules in phase A will increase 
and the number in phase B will decrease. If the chemical potentials are equal, the Gibbs 
free energy is independent of the number of molecules in the two phases. Therefore, the 
chemical potentials are equal at coexistence: 


Ma — Mb- (3) 

Let’s consider the familiar example of water. At normal pressures and temperatures, 
water has three phases: liquid water, solid ice, and water vapor, and its P-T phase diagram 
is similar to that shown for argon in Figure 4.2(a) — the P-V phase diagram for water is 
somewhat different because the density of the liquid phase is larger than the density of 
the solid ice phase; see Problems 4.15 and 4.20. At P — 1 atm, water and water vapor coex¬ 
ist at T — 100°C, the “boiling point’’ — while boiling is a nonequilibrium process, boiling 
begins at the temperature at which the equilibrium vapor pressure is equal to the local 
atmospheric pressure. Consider a two-phase sample of water and water vapor at T = 99 °C. 
A two-phase sample containing both liquid water and water vapor is easy to create in a 
constant volume assembly. If there is sufficient volume available, liquid water will evapo¬ 
rate until the water vapor pressure reaches the coexistence pressure at that temperature 
P a ( 99°C) = 0.965 atm. If the applied pressure is then increased to, and held constant at, 
P— 1 atm while maintaining a constant temperature of T = 99 C, the system will be out 
of equilibrium. At constant pressure, the system will return to equilibrium by decreasing 
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its volume as water vapor condenses into the liquid phase until the the system is com¬ 
pletely liquid water. This lowers the Gibbs free energy until it has the equilibrium value 
determined by the chemical potential of liquid water at this pressure and temperature. 

On the other hand, if T = 100 °C and P= 1 atm, the chemical potentials of the liquid 
and vapor phases are equal, so any combination of water vapor and liquid water has the 
same Gibbs free energy. The proportion of water and vapor will change as heat is added 
or removed. The latent heat of vaporization of water L v = 540 cal/g = 2260 kj/kg is the heat 
needed to convert liquid into vapor. 

The coexistence pressure P a (T) defines the phase boundary between any two phases 
in the P-T plane, as shown in Figure 4.2(a). From equation (3), the coexistence pressure 
obeys 


HA(.P a (T),T) = HB(Pa(T),T). 


(4) 


The derivatives of the chemical potentials are related by 

/ 9ma\ / 9//a\ dP n _ / 9mb\ / 9mb \ dP a 
\ dT )p \ dP ) T dT ~\dT )p\ dP ) T dT ’ 

while the entropy per particle 5 = S/N and specific volume v—V/N are given by 


5 = 



V = 



(6a) 

(6b) 


see equation (4.5.6). Equations (5) and (6) give the Clausius-Clapeyron equation 

dPg _ s B -s A _ As _ L 
dT v B — v A Av TAv’ 

where L — T As is the latent heat per particle. The slope of the coexistence curve depends 
on the discontinuities of the entropy per particle and the volume per particle. Equation (7) 
applies very generally to all first-order phase transitions and can be used to determine 
the coexistence curve as a function of temperature; see Section 4.4, Problems 4.11, and 
4.14 through 4.16. 

At a triple point, the chemical potentials of three phases are equal: 


MA = MB = MC- (6) 

The slopes of the three coexistence lines that define the triple point are related since 
Asab + Asbc + Asg 4 = 0 and A vab + A v B c + A^ca = 0. This guarantees that each coexis¬ 
tence line between two phases at the triple point “points into” the third phase; see 
Problem 4.i7. 
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Problems 


4.1. Show that the entropy of a system in the grand canonical ensemble can be written as 

S = -kJ2Pr,slnP r ,s, 

r,s 

where P riS is given by equation (4.1.9). 

4.2. In the thermodynamic limit (when the extensive properties of the system become infinitely large, 
while the intensive ones remain constant), the ^-potential of the system may be calculated by 
taking only the largest term in the sum 

OO 

X z N 'Q Nr (V,T). 

N r =o 


Verify this statement and interpret the result physically. 

4.3. A vessel of volume V a)l contains iV (0) molecules. Assuming that there is no correlation whatsoever 
between the locations of the various molecules, calculate the probability, P(N, V), that a region of 
volume V (located anywhere in the vessel) contains exactly N molecules. 

(a) Show that IV = N {0) p and (AAT)r.m.s. = [N i0) p(l - p)) 1 / 2 , where p= V/V i0) . 

(b) Show that if both Af (0) p and A/ 30 ' (1 — p) are large numbers, the function P(N, V ) assumes a 
Gaussian form. 

(c) Further, if p <sc 1 and N « AT* 0 ’, show that the function P(N, V) assumes the form of a Poisson 
distribution: 


P(N) = e~ N 


(N) n 

N\ 


4.4. The probability that a system in the grand canonical ensemble has exactly N particles is given by 


p(N) 


z n Qn(V,T ) 
Q(z,V,T) ' 


Verify this statement and show that in the case of a classical, ideal gas the distribution of particles 
among the members of a grand canonical ensemble is identically a Poisson distribution. Calculate 
the root-mean-square value of (AN) for this system both from the general formula (4.5.3) and from 
the Poisson distribution, and show that the two results are the same. 

4.5. Show that expression (4.3.20) for the entropy of a system in the grand canonical ensemble can also 
be written as 


S=k 



n,v 


4.6. Define the isobaric partition function 


i r°° 

Y N (P, T) = jsJ Qn(V, T)e-V pv dV. 


Show that in the thermodynamic limit the Gibbs free energy (4.7.1) is proportional to In Yn(P, T). 
Evaluate the isobaric partition function for a classical ideal gas and show that PV = NkT. [The 
factor of the cube of the thermal deBroglie wavelength, a 3 , serves to make the partition function 
dimensionless and does not contribute to the Gibbs free energy in the thermodynamic limit.] 

4.7. Consider a classical system of noninteracting, diatomic molecules enclosed in a box of volume V 
at temperature T. The Hamiltonian of a single molecule is given by 


H(ri,r 2 ,p v p 2 ) 


^<P? + P§>+i*ln-»*l a . 


Study the thermodynamics of this system, including the dependence of the quantity (r 2 2 ) on T. 
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4 . 8 . Determine the grand partition function of a gaseous system of “magnetic" atoms (with J =\ and 
g= 2) that can have, in addition to the kinetic energy, a magnetic potential energy equal to p bH 
or —fj-sH, depending on their orientation with respect to an applied magnetic field II. Derive an 
expression for the magnetization of the system, and calculate how much heat will be given off by 
the system when the magnetic field is reduced from H to zero at constant volume and constant 
temperature. 

4 . 9 . Study the problem of solid-vapor equilibrium (Section 4.4) by setting up the grand partition 
function of the system. 

4 . 10 . A surface with No adsorption centers has N{< No) gas molecules adsorbed on it. Show that the 
chemical potential of the adsorbed molecules is given by 

u = kTln -, 

(N 0 - N)a(T) 

where a(T) is the partition function of a single adsorbed molecule. Solve the problem by 
constructing the grand partition function as well as the partition function of the system. 

[Neglect the intermolecular interaction among the adsorbed molecules.] 

4.11 . Study the state of equilibrium between a gaseous phase and an adsorbed phase in a single¬ 
component system. Show that the pressure in the gaseous phase is given by the Langmuir 
equation 


P„ = -x (a certain function of temperature), 

1 — 0 

where 6 is the equilibrium fraction of the adsorption sites that are occupied by the adsorbed 
molecules. 

4 . 12 . Show that for a system in the grand canonical ensemble 

4 . 13 . Define a quantity/as 


J = E - Nfi = TS — PV. 


Show that for a system in the grand canonical ensemble 


(AJ) 2 = kT 2 C v + 


a-* 


2 

(AN) 2 . 


4 . 14 . Assuming that the latent heat of vaporization of water L v = 2260kJ/kg is independent of 
temperature and the specific volume of the liquid phase is negligible compared to the specific 
volume of the vapor phase, (Vapor = kT/P a (T), integrate the Clausius-Clapeyron equation (4.7.7) 
to obtain the coexistence pressure as a function of temperature. Compare your result to the 
experimental vapor pressure of water from the triple point to 200 °C. The equilibrium vapor 
pressure at 373 K is 101kPa= latm. 

4 . 15 . Assuming that the latent heat of sublimation of ice L s = 2500kJ/kg is independent of temperature 
and the specific volume of the solid phase is negligible compared to the specific volume of the 
vapor phase, (Vapor = kT/P„ ( T), integrate the Clausius-Clapeyron equation (4.7.7) to obtain the 
coexistence pressure as a function of temperature. Compare your result to the experimental vapor 
pressure of ice from T = 0 to the triple point. The equilibrium vapor pressure at the triple point is 
612 Pa. 

4 . 16 . Calculate the slope of the solid-liquid transition line for water near the triple point T = 273.16K, 
given that the latent heat of melting is 80 cal/g, the density of the liquid phase is 1.00 g/cm 3 , and 
the density of the ice phase is 0.92 g/cm 3 . Estimate the melting temperature at P = 100 atm. 
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4 . 17 . Show that the Clausius-Clapeyron equation (4.7.7) guarantees that each of the coexistence curves 
at the triple point of a material “points into” the third phase; for example, the slope of the 
solid-vapor coexistence line has a value in-between the slopes of the the the solid-liquid and 
liquid-vapor coexistence lines. 

4 . 18 . Sketch the P-V phase diagram for helium-4 using the sketch of the P-T phase diagram in 
Figure 4.3. 

4 . 19 . Derive the equivalent of the Clausius-Clapeyron equation (4.7.7) for the slope of the coexistence 
chemical potential as a function of temperature. Use the fact that the pressures P(/j. } T ) in two 
different phases are equal on the coexistence curve. 

4 . 20 . Sketch the P-T and P-V phase diagrams of water, taking into account the fact that the mass 
density of the liquid phase is larger than the mass density of the solid phase. 



Formulation of Quantum Statistics 


The scope of the ensemble theory developed in Chapters 2 through 4 is extremely general, 
though the applications considered so far were confined either to classical systems or to 
quantum-mechanical systems composed of distinguishable entities. When it comes to 
quantum-mechanical systems composed of indistinguishable entities, as most physical 
systems are, considerations of the preceding chapters have to be applied with care. One 
finds that in this case it is advisable to rewrite ensemble theory in a language that is more 
natural to a quantum-mechanical treatment, namely the language of the operators and 
the wavefunctions. Insofar as statistics are concerned, this rewriting of the theory may not 
seem to introduce any new physical ideas as such; nonetheless, it provides us with a tool 
that is highly suited for studying typical quantum systems. And once we set out to study 
these systems in detail, we encounter a stream of new, and altogether different, physical 
concepts. In particular, we find that the behavior of even a noninteracting system, such as 
the ideal gas, departs considerably from the pattern set by the classical treatment. In the 
presence of interactions, the pattern becomes even more complicated. Of course, in the 
limit of high temperatures and low densities, the behavior of all physical systems tends 
asymptotically to what we expect on classical grounds. In the process of demonstrating 
this point, we automatically obtain a criterion that tells us whether a given physical sys¬ 
tem may or may not be treated classically. At the same time, we obtain rigorous evidence in 
support of the procedure, employed in the previous chapters, for computing the number, 
T, of microstates (corresponding to a given macrostate) of a given system from the vol¬ 
ume, to, of the relevant region of its phase space, namely T ~ co/hf, where/ is the number 
of “degrees of freedom” in the problem. 


5.1 Quantum-mechanical ensemble theory: 
the density matrix 

We consider an ensemble of <N identical systems, where W » 1. These systems are char¬ 
acterized by a (common) Hamiltonian, which may be denoted by the operator H. At time 
t, the physical states of the various systems in the ensemble will be characterized by 
the wavefunctions /(r,-,f), where r ; denote the position coordinates relevant to the sys¬ 
tem under study. Let / fc (r,-, t) denote the (normalized) wavefunction characterizing the 
physical state in which the fcth system of the ensemble happens to be at time /; natu¬ 
rally, k— 1,2,..., W. The time variation of the function / fc (t) will be determined by the 
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Schrodinger equation 1 


Hf k (t) = ihijr k (t). 


( 1 ) 


Introducing a complete set of orthonormal functions <p n , the wavefunctions \[r k (t) may be 
written as 

Vr fc (r) = ^a£(t)</>„, (2) 

n 

where 

a k (t) = f <p* n ^ k (t)dr-, (3) 

here, <p* denotes the complex conjugate of </>„ while dr denotes the volume element of 
the coordinate space of the given system. Clearly, the physical state of the kth system can 
be described equally well in terms of the coefficients a k (t). The time variation of these 
coefficients will be given by 

iha k (t ) = ih J 0*i/f fc (t)dr = J 4>^H\l/ k (t)dr 

^ k m > 

— y (4) 

m 

where 

Hum — / <t>* n H(t> m dr. (5) 


The physical significance of the coefficients a k ( t) is evident from equation (2) . They are 
the probability amplitudes for the various systems of the ensemble to be in the various 
states </>„; to be practical, the number \a k (L)\ 2 represents the probability that a measure¬ 
ment at time t finds the /cth system of the ensemble to be in the particular state </>„. Clearly, 
we must have 


y, | a k (t)\ 2 = 1 (for all k). (6) 

n 

We now introduce the density operator p(t), as defined by the matrix elements 

Pmnjt) = ^a k m (t)a k * (t) |; (7) 

k= 1 

clearly, the matrix element p mn {t ) is the ensemble average of the quantity a m (t)a* n (t), 
which, as a rule, varies from member to member in the ensemble. In particular, the 
diagonal element p nn {t) is the ensemble average of the probability \a n (l)\ 2 , the latter 

1 For simplicity of notation, we suppress the coordinates r,- in the argument of the wavefunction 1 jr k . 
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itself being a (quantum-mechanical) average. Thus, we encounter here a double-averaging 
process — once due to the probabilistic aspect of the wavefunctions and again due to the 
statistical aspect of the ensemble. The quantity p nn (t) now represents the probability that 
a system, chosen at random from the ensemble, at time t, is found to be in the state </>„. In 
view of equations (6) and (7), 

y Pnn = 1 * ( 8 ) 

n 


We shall now determine the equation of motion for the density matrix p m n(t). We 
obtain, with the help of the foregoing equations, 


ihpmnit) = + 

A:=l 


k=l L ' l 


— y '. { H m lPlnd ) Pml(t)Hi n 
l 

= (Hp — pH)mn) 


(9) 


here, use has been made of the fact that, in view of the Hermitian character of the operator 
H,H*j = H[ n . Using the commutator notation, equation (9) may be written as 

ihp = [H, /5]_. (10) 


Equation (10) is the quantum-mechanical analog of the classical equation (2.2.10) of 
Liouville. As expected in going from a classical equation of motion to its quantum- 
mechanical counterpart, the Poisson bracket [p,H] has given place to the commutator 

(pH — Hp)/iH. 

If the given system is known to be in a state of equilibrium, the corresponding ensemble 
must be stationary, that is, p mn = 0. Equations (9) and (10) then tell us that, for this to be the 
case, (i) the density operator p must be an explicit function of the Hamiltonian operator H 
(for then the two operators will necessarily commute) and (ii) the Hamiltonian must not 
depend explicitly on time, that is, we must have (i ) p = p (H) and (ii) H = 0. Now, if the basis 
functions <p n were the eigenfunctions of the Hamiltonian itself, then the matrices H and 
p would be diagonal: 


Hmn — En^n 


Pmn — Pn&mn- 


(ll) 2 


2 It may be noted that in this (so-called energy) representation the density operator p may be written as 

P Pn(<t>n\, 

n 


( 12 ) 


for then 


Pkl — £<0d«Pn(0nl*> — X>„PA/ — Pk&kh 
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The diagonal element p n , being a measure of the probability that a system, chosen at ran¬ 
dom (and at any time) from the ensemble, is found to be in the eigenstate <j > n , will naturally 
depend on the corresponding eigenvalue E n of the Hamiltonian; the precise nature of this 
dependence is, however, determined by the “kind” of ensemble we wish to construct. 

In any representation other than the energy representation, the density matrix may or 
may not be diagonal. However, quite generally, it will be symmetric: 


(13) 


Pmn — Pnm ■ 


The physical reason for this symmetry is that, in statistical equilibrium, the tendency of a 
physical system to switch from one state (in the new representation) to another must be 
counterbalanced by an equally strong tendency to switch between the same states in the 
reverse direction. This condition of detailed balancing is essential for the maintenance of 
an equilibrium distribution within the ensemble. 

Finally, we consider the expectation value of a physical quantity G, which is dynami¬ 
cally represented by an operator G. This will be given by 



(14) 


In terms of the coefficients a k , 


l 

<G> = ^E El a n* a m G i 


r nm 


(15) 


where 



(16) 


Introducing the density matrix p, equation (15) becomes 

(G) = E PmnGnm — E(pG) mm — Tr(pG). 


(17) 


m 


Taking G = f, where 1 is the unit operator, we have 


Tr(p) = 1 


(18) 


which is identical to (8). It should be noted here that if the original wavefunctions \j/ k were 
not normalized then the expectation value (G) would be given by the formula 



( 19 ) 
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instead. In view of the mathematical structure of formulae (17) and (19), the expectation 
value of any physical quantity G is manifestly independent of the choice of the basis {</>„}, 
as it indeed should be. 


5.2 Statistics of the various ensembles 

5.2.A The microcanonical ensemble 

The construction of the microcanonical ensemble is based on the premise that the sys¬ 
tems constituting the ensemble are characterized by a fixed number of particles N, a fixed 
volume V, and an energy lying within the interval (E— \ A,E+ \ A), where A <$: E. The 
total number of distinct microstates accessible to a system is then denoted by the sym¬ 
bol r(N, V, E; A) and, by assumption, any one of these microstates is as likely to occur as 
any other. This assumption enters into our theory in the nature of a postulate and is often 
referred to as the postulate of equal a priori probabilities for the various accessible states. 

Accordingly, the density matrix p mn (which, in the energy representation, must be a 
diagonal matrix) will be of the form 


Pmn — Pn&mni 


( 1 ) 


with 


pn 


1/ r for each of the accessible states, 
0 for all other states; 


( 2 ) 


the normalization condition (5.1.18) is clearly satisfied. As we already know, the thermody¬ 
namics of the system is completely determined from the expression for its entropy which, 
in turn, is given by 


S=k lnr. (3) 

Since T, the total number of distinct, accessible states, is supposed to be computed 
quantum-mechanically, taking due account of the indistinguishability of the particles right 
from the beginning, no paradox, such as Gibbs’, is now expected to arise. Moreover, if 
the quantum state of the system turns out to be unique (T = 1), the entropy of the sys¬ 
tem will identically vanish. This provides us with a sound theoretical basis for the hitherto 
empirical theorem of Nernst (also known as the third law of thermodynamics). 

The situation corresponding to the case T = 1 is usually referred to as a pure case. In 
such a case, the construction of an ensemble is essentially superfluous, because every sys¬ 
tem in the ensemble has got to be in one and the same state. Accordingly, there is only one 
diagonal element p nn that is nonzero (actually equal to unity), while all others are zero. The 
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density matrix, therefore, satisfies the relation 

p 2 = p- (4) 


In a different representation, the pure case will correspond to 


1 ^ 

Pmn = — ^2 a m a n* = a m a *n 


(5) 


k= 1 


because all values of k are now literally equivalent. We then have 

Pmn = y PmlPln = ^ . a m a i a l a n 


because = 1 


— Pmn- 


( 6 ) 


Relation (4) thus holds in all representations. 

A situation in which r > 1 is usually referred to as a mixed case. The density matrix, in 
the energy representation, is then given by equations (1) and (2). If we now change over to 
any other representation, the general form of the density matrix should remain the same, 
namely (i) the off-diagonal elements should continue to be zero, while (ii) the diagonal 
elements (over the allowed range) should continue to be equal to one another. Now, had 
we constructed our ensemble on a representation other than the energy representation 
right from the beginning, how could we have possibly anticipated ab initio property (i) 
of the density matrix, though property (ii) could have been easily invoked through a pos¬ 
tulate of equal a priori probabilities ? To ensure that property (i), as well as property (ii), 
holds in every representation, we must invoke yet another postulate, namely the postulate 
of random a priori phases for the probability amplitudes a k j, which in turn implies that 
the wavefunction \// k , for all k, is an incoherent superposition of the basis { (j> n }. As a con¬ 
sequence of this postulate, coupled with the postulate of equal a priori probabilities, we 
would have in any representation 


Pmn = 


, e/V , e/V 


«/V 


k= 1 


M 

W 

k= 1 

= cie‘(Om-rt) 


= CS 


mn> 


(7) 


as it should be for a microcanonical ensemble. 

Thus, contrary to what might have been expected on customary grounds, to secure the 
physical situation corresponding to a microcanonical ensemble, we require in general two 
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postulates instead of one! The second postulate arises solely from quantum-mechanics 
and is intended to ensure noninterference (and hence a complete absence of correlations) 
among the member systems; this, in turn, enables us to form a mental picture of each 
system of the ensemble, one at a time, completely disentangled from other systems. 

5.2.B The canonical ensemble 

In this ensemble the macrostate of a member system is defined through the parameters 
N, V, and T ; the energy E is now a variable quantity. The probability that a system, chosen 
at random from the ensemble, possesses an energy E r is determined by the Boltzmann 
factor exp (—/?£>), where ft — 1/kT; see Sections 3.1 and 3.2. The density matrix in the 
energy representation is, therefore, taken as 


Pmn — Pn&mri) 


( 8 ) 


with 


p n = Cexp {-ftE n y, n = 0,1,2,... 


(9) 


The constant C is determined by the normalization condition (5.1.18), whereby 

C= 1 1 


Jftexp(—ftE n ) Qn(P) ’ 


( 10 ) 


where Qn{P) is the partition function of the system. In view of equations (5.1.12), see 
footnote 2, the density operator in this ensemble may be written as 


p = y~'|</>n)- e P En (<t>n\ 

v Qn(P) 


p -PH 




Qn(P) 

1 -e-^= e 


Qn(P) Tr (e-P* 1 ) ’ 


( 11 ) 


for the operator Y^n\fn){<Pn\ is identically the unit operator. It is understood that the 
operator exp (~ftH) in equation (11) stands for the sum 


^(-ty 


; (ftH)j 


7=0 


]■ 


( 12 ) 
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The expectation value (G)jv of a physical quantity G, which is represented by an operator 
G, is now given by 

(G)n = Tr (pG) = — Tr(Ge~ 

Qn(p) 

Tr(Ge~PH) 

= — 1 (13) 
Tr(e-^) 

the suffix N here emphasizes the fact that the averaging is being done over an ensemble 
with N fixed. 


fSHj 


5.2.C The grand canonical ensemble 

In this ensemble the density operator p operates on a Hilbert space with an indefi¬ 
nite number of particles. The density operator must therefore commute not only with 
the Hamiltonian operator H but also with a number operator h whose eigenvalues are 
0,1,2,.... The precise form of the density operator can now be obtained by a straightfor¬ 
ward generalization of the preceding case, with the result 


P = 


1 -P(H-uh) 
&(p,V,T) 


(14) 


where 


Q(p, V, T) = = Tr!e -/j(h- M n)j_ (15) 

r,s 

The ensemble average (G) is now given by 

(G) =---TrlGfT^e^") 

' ' a(p,v,T) y ’ 

oo 

E z N (G) nQn(P) 

= — -, (16) 

E z n Qn(P) 

N =0 

where z(= e^) is the fugacity of the system while (G)jv is the canonical-ensemble average, 
as given by equation (13). The quantity d(/x, V, T) appearing in these formulae is, clearly, 
the grand partition function of the system. 

5.3 Examples 

5.3.A An electron in a magnetic field 

We consider, for illustration, the case of a single electron that possesses an intrinsic spin 
\ha and a magnetic moment pg, where a is the Pauli spin operator and ps — eh/2mc. 
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The spin of the electron can have two possible orientations, f or with respect to an 
applied magnetic field B. If the applied field is taken to be in the direction of the z-axis, the 
configurational Hamiltonian of the spin takes the form 

H = -h b (<t ■ B) = —fi B Ba z . (1) 


In the representation that makes <r z diagonal, namely 


0 -A 
i 0/ ’ 


the density matrix in the canonical ensemble would be 

( e -/3U) 

(P)=~ - — 

Tr(e-^) 


1 0 
0 -1 


e^ l B B + e P^b b 


e Pl i b b )' 


We thus obtain for the expectation value a z 

e Pfl B B _ e ~Pf*B B 

(o z ) = Tr(p<y z ) = ——- - = tanh (4) 

e P^B n _|_ e ppb d 

in perfect agreement with the findings of Sections 3.9 and 3.10. 

5.3.B A free particle in a box 

We now consider the case of a free particle, of mass m, in a cubical box of side L. The 
Hamiltonian of the particle is given by 

■ T h 2 o h 2 ( 9 2 a 2 9 2 \ 

H — -V 2 =-I — H-H-), (5) 

2 m 2 m y 9 x 2 9 y 2 dz 2 J 

while the eigenfunctions of the Hamiltonian that satisfy periodic boundary conditions, 


are given by 


<t>(x + L,y,z) = <p(x,y + L,z) = <t>(x,y,z + Q 
= <p(x,y,z), 


= J3j2 ex P {ik ' r) ’ 


the corresponding eigenvalues E being 


(8) 
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and the corresponding wave vector 1c being 

k = (k x , k y ,k z )=^-(n x ,n y , n z ); (9) 

the quantum numbers n x , n y , and n z must be integers (positive, negative, or zero). 
Symbolically, we may write 

2n 

k = —n, (10) 

where n is a vector with integral components 0, ±1, ±2,.... 

We now proceed to evaluate the density matrix (p) of this system in the canonical 
ensemble; we shall do so in the coordinate representation. In view of equation (5.2.11), 
we have 


{r\e~P H \r') = ^(r|E)e" /i£ (£'|r') 

E 

= J2 e ~ fiE ^ E ^ r) ^ r 'y 


at) 


Substituting from equation (7) and making use of relations (8) and (10), we obtain 


(r\e P H \r') = j^J2 ex P 
k 

“c&J 

= ( ———^ 
\2nfiti 2 ) 


fth 2 n , 

--—kr + ilc ■ (r - r) 
2 m 


exp 

3/2 


P h2 a 


2m 


k z + ik • (r - r') 


d z k 


exp 


./1 2 


2 ph 2 


\r — r\ 


see equations (B.41) and (B.42) in Appendix B. It follows that 

Tr(e“^) = J (r\e~^\r)d 3 r 


= V 


m \ 


3/2 


2nf)h 2 ) 


( 12 ) 


(13) 


The expression in equation (13) is indeed the partition function, Qi(/8), of a single particle 
confined to a box of volume V; see equation (3.5.19). Dividing (12) by (13), we obtain for 
the density matrix in the coordinate representation 


<r|p|r') = - exp 


V *1 2,6 ft 2 


|r-rf 


(14) 


As expected, the matrix p r y is symmetric between the states r and r . Moreover, the 
diagonal element (r|p|r), which represents the probability density for the particle to be in 
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the neighborhood of the point r, is independent of r; this means that, in the case of a sin¬ 
gle free particle, all positions within the box are equally likely to obtain. A nondiagonal 
element (r|p|r'>, on the other hand, is a measure of the probability of “spontaneous tran¬ 
sition” between the position coordinates r and r' and is therefore a measure of the relative 
“intensity” of the wave packet (associated with the particle) at a distance r — r' from the 
center of the packet. The spatial extent of the wave packet, which is a measure of the uncer¬ 
tainty involved in locating the position of the particle, is clearly of order h/(mkT )b 2 ; the 
latter is also a measure of the mean thermal wavelength of the particle. The spatial spread 
found here is a purely quantum-mechanical effect; quite expectedly, it tends to vanish at 
high temperatures. In fact, as /3 0, the behavior of the matrix element (14) approaches 

that of a delta function, which implies a return to the classical picture of a point particle. 

Finally, we determine the expectation value of the Hamiltonian itself. From equa¬ 
tions (5) and (14), we obtain 


(H) = Tr (Hp) = - 


h 2 

2 mV 


/I 


V 2 exp 


m 
2/3 ft 2 


\r — r'\ 2 


cPr 


*-W' r ~ A 


-WvS\ 

3 3 

= — = -kT, 

2/3 2 

which was indeed expected. Otherwise, too, 


exp 


./ ,2 


2 ph 2 


\r—r\ 


err 


(15) 


= Jl (He ^ ) = _Ai n xr(e“^) 

Tr{e-P«) 3/8 

which, on combination with (13), leads to the same result. 


(16) 


5.3.C A linear harmonic oscillator 

Next, we consider the case of a linear harmonic oscillator whose Hamiltonian is given by 


A h 3 1 2 2 

H =-=- + - mco q , 

2m dq 2 2 ^ 


(17) 


with eigenvalues 


En — 



n= 0,1,2,... 


(18) 


and eigenfunctions 



1/4 


c ~q/2)s 2 

(2" h !) 1 / 2 


(19) 
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where 


and 


M-) i 


H n & = (- 1)V 2 



(20) 


( 21 ) 


The matrix elements of the operator exp (—/3H) in the ^-representation are given by 


{q\e f tH \q') = Y2e >iEn <p„(q)<p n (q') 

n=0 

_ /WtoQ/ 2 ( i/ 2 )( ? 2 +f' 2 )y- | c -(n+l/2)flhaj Hn(M )H n (f) 

v nh ) I 2 n n\ 

n =0 1 


The summation over n is somewhat difficult to evaluate; nevertheless, the final result is 3 


(q\e pH \q 1 ) = 


27rfi,sinh(/lfi,a>) 

(q + q') 2 tanh 


x exp 


miD 

' 4ft~ 


phw\ 

2 J 


+ (q — q') 2 coth 


pha>\ 

2 J 


(23) 


which gives 


OO 

Tr(e“^) = J {q\e~ pH \q)dq 


OO 


2jrftsinh(/5fi,ft)) 

1 


2 sinh ^ j jS 


"| 1/2 

] GXP 

— OO 

e -(l/2)Pha> 

1 — ' 




dq 


(24) 


Expression (24) is indeed the partition function of a linear harmonic oscillator; see 
equation (3.8.14). At the same time, we find that the probability density for the oscillator 
coordinate to be in the vicinity of the value q is given by 


(q\p\q) = 


mco tanh (jiPhuA 


nh 


1/2 


exp 


incoq 2 

h 



(25) 


3 The mathematical details of this derivation can be found in Kubo (1965, pp. 175-177). 
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we note that this is a Gaussian distribution in q , with mean value zero and root-mean- 
square deviation 


^r.m.s. — 


h 


2mwtanh 


(z0fc“)_ 


1/2 


(26) 


The probability distribution (25) was first derived by Bloch in 1932. In the classical limit 
(Phco << 1), the distribution becomes purely thermal — free from quantum effects: 


/ m„f \' ' 2 

eXP 


9 ? 

mcoq 

2kT 


(27) 


with dispersion (kT/mco 2 ) 1 ^ 2 . At the other extreme 1), the distribution becomes 

purely quantum-mechanical — free from thermal effects: 

{q\p\q )«exp 

with dispersion ( h/2ma> l 1 / 2 . Note that the limiting distribution (28) is precisely the one 
expected for an oscillator in its ground state (n = 0), that is one with probability density 
</>o(g); see equations (19) through (21). 

In view of the fact that the mean energy of the oscillator is given by 


mcoq c 


(28) 


(H) = lnTr(e 


(29) 


we observe that the temperature dependence of the distribution (25) is solely determined 
by the expectation value (H) . Actually, we can write 


iq\p\q) = 


mr 
2 7i (H) 


v 1/2 


j exp 

mco z q^ 

2(H) _ 


(30) 


with 


q r.m.s. 



(31) 


It is now straightforward to see that the mean value of the potential energy (^mco 2 q 2 ) of 
the oscillator is \ (H); accordingly, the mean value of the kinetic energy {p 2 /2m) will also 
be the same. 
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5.4 Systems composed of indistinguishable 
particles 

We shall now formulate the quantum-mechanical description of a system of N identical 
particles. To fix ideas, we consider a gas of noninteracting particles; the findings of this 
study will be of considerable relevance to other systems as well. 

Now, the Hamiltonian of a system of N noninteracting particles is simply a sum of the 
individual single-particle Hamiltonians: 


N 

H(q,p) = ’*rHi(q i ,Pi); ( 1 ) 

i=i 

here, (qi,pi) are the coordinates and momenta of the ith particle while Ft, is its Hamilto¬ 
nian. 4 Since the particles are identical, the Hamiltonians Hi(i= 1,2,..., AF) are formally the 
same; they only differ in the values of their arguments. The time-independent Schrodinger 
equation for the system is 


tffeW = £fe(<7), (2) 

where E is an eigenvalue of the Hamiltonian and fn(q) the corresponding eigenfunction. 
In view of (1), we can write a straightforward solution of the Schrodinger equation, namely 

N 

few=npi 

i=i 


with 


N 

E=J2 e ‘ ] (4) 

i= l 

the factor u £i (qi) in (3) is an eigenfunction of the single-particle Hamiltonian Hi(qi,pi), 
with eigenvalue ef. 


HiU Si (qi) — SiU Si {qi). ( 5 ) 

Thus, a stationary state of the given system may be described in terms of the single-particle 
states of the constituent particles. In general, we may do so by specifying the set of num¬ 
bers {nf to represent a particular state of the system; this would imply that there are »/ 
particles in the eigenstate characterized by the energy value e/. Clearly, the distribution set 

4 We are studying here a single-component system composed of “spinless” particles. Generalization to a system 
composed of particles with spin and to a system composed of two or more components is quite straightforward. 
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{iij} must conform to the conditions 


J2 n i = N 

i 

and 


( 6 ) 


^2 met = E. ( 7 ) 

i 

Accordingly, the wavefunction of this state may be written as 


Tl\-\-Yl2 

fn(q) =n U](m) n u 2(m)..., ( 8 ) 

m= 1 m=n\+l 

where the symbol Ui(m) stands for the single-particle wavefunction u f j (q m ). 

Now, suppose we effect a permutation among the coordinates appearing on the right 
side of (8); as a result, the coordinates (1,2, ...,N) get replaced by ( P1,P2 ,.. ,,PN), say. The 
resulting wavefunction, which we may call P^(q), will be 

Hi 7 * 1 + 722 

pfn(q) = n u ' {pm) n u 2.( pm )■■■■ o) 

772=1 722=721 + 1 


In classical physics, where the particles of a given system, even though identical, are 
regarded as mutually distinguishable, any permutation that brings about an interchange 
of particles in two different single-particle states is recognized to have led to a new, physi¬ 
cally distinct, microstate of the system. For example, classical physics regards a microstate 
in which the so-called 5th particle is in the state m and the so-called 7th particle in the 
state Uj(j / i ) as distinct from a microstate in which the 7th particle is in the state zq and 
the 5th particle in the state uj. This leads to 


n\\n2\... 

(supposedly distinct) microstates of the system, corresponding to a given mode of distri¬ 
bution {ni}. The number (10) would then be ascribed as a “statistical weight factor” to 
the distribution set {«/}. Of course, the “correction” applied by Gibbs, which has been 
discussed in Sections 1.5 and 1.6, reduces this weight factor to 


W c {m) 


1 

«i!w 2 !...' 


( 11 ) 


And the only way one could understand the physical basis of that “correction” was in terms 
of the inherent indistinguishability of the particles. 

According to quantum physics, however, the situation remains unsatisfactory even 
after the Gibbs correction has been incorporated, for, strictly speaking, an interchange 
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among identical particles, even if they are in different single-particle states, should not 
lead to a new microstate of the system! Thus, if we want to take into account the indistin- 
guishability of the particles properly, we must not regard a microstate in which the “5th” 
particle is in the state Ui and the “7th” in the state uj as distinct from a microstate in which 
the “7th” particle is in the state Uj and the “5th” in the state Uj (even if i / j), for the labeling 
of the particles as No. 1, No. 2, and so on (which one often resorts to) is at most a matter of 
convenience — it is not a matter of reality. In other words, all that matters in the descrip¬ 
tion of a particular state of the given system is the set of numbers n, that tell us how many 
particles there are in the various single-particle states uf, the question, “which particle is 
in which single-particle state?” has no relevance at all. 

Accordingly, the microstates resulting from any permutation P among the N parti¬ 
cles (so long as the numbers n, remain the same) must be regarded as one and the 
same microstate. For the same reason, the weight factor associated with a distribution set 
{n,}, provided that the set is not disallowed on some other physical grounds, should be 
identically equal to unity, whatever the values of the numbers n, may be: 


Wq{Ui) = 1 . ( 12) 5 

Indeed, if for some physical reason the set {«*■} is disallowed, the weight factor W tj for that 
set should be identically equal to zero; see, for instance, equation (19). 

At the same time, a wavefunction of the type (8), which we may call Boltzmannian 
and denote by the symbol Vusoiiz^/). is inappropriate for describing the state of a system 
composed of indistinguishable particles because an interchange of arguments among the 
factors Ui and Uj, where i ^ j, would lead to a wavefunction that is both mathematically and 
physically different from the one we started with. Now, since a mere interchange of the 
particle coordinates must not lead to a new microstate of the system, the wavefunction 
i l/E(q) must be constructed in such a way that, for all practical purposes, it is insensitive 
to any interchange among its arguments. The simplest way to do this is to set up a lin¬ 
ear combination of all the Nl functions of the type (9) that obtain from (8) by all possible 
permutations among its arguments; of course, the combination must be such that if a per¬ 
mutation of coordinates is carried out in it, then the wavefunctions t// and hi,// must satisfy 
the property 


\Pif\ 2 = \f\ 2 . 


(13) 


This leads to the following possibilities: 


Pt/r = i/f for all P, (14) 

5 It may be mentioned here that as early as in 1905 Ehrenfest pointed out that to obtain Planck’s formula for the 
black-body radiation one must assign equal a priori probabilities to the various distribution sets {n{\. 
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which means that the wavefunction is symmetric in all its arguments, or 


P\[r 


■ 


if P is an even permutation, 
if P is an odd permutation, 


(15) 6 


which means that the wavefunction is antisymmetric in its arguments. We call these 
wavefunctions i jrs and \I/a, respectively; their mathematical structure is given by 


fsW = const. ^PiAboIizW 
p 


(16) 


and 


f A (<j0 = const. 8 P Pf Boitz (q), (17) 

p 

where Sp in the expression for i/t 4 is +1 or — 1 according to whether the permutation P is 
even or odd. 

We note that the function can be written in the form of a Slater determinant: 


M((l) M,(2) ••• Ui(N) 
Uj( 1) Uj( 2) Uj(N) 


\I/A(q) = const. 


(18) 


«z(l) ui( 2) U i(N) 


where the leading diagonal is precisely the Boltzmannian wavefunction while the other 
terms of the expansion are the various permutations thereof; positive and negative signs in 
the combination (17) appear automatically as we expand the determinant. On interchang¬ 
ing a pair of arguments (which amounts to interchanging the corresponding columns of 
the determinant), the wavefunction i/m merely changes its sign, as it indeed should. How¬ 
ever, if two or more particles happen to be in the same single-particle state, then the 
corresponding rows of the determinant become identical and the wavefunction vanishes. 7 
Such a state is physically impossible to realize. We therefore conclude that if a system com¬ 
posed of indistinguishable particles is characterized by an antisymmetric wavefunction, 

6 An even (odd) permutation is one that can be arrived at from the original order by an even (odd) number of “pair 
interchanges" among the arguments. For example, of the six permutations 

(1,2,3), (2,3,1), (3,1,2), (1,3,2), (3,2,1), and (2,1,3), 

of the arguments 1, 2, and 3, the first three are even permutations while the last three are odd. A single interchange, 
among any two arguments, is clearly an odd permutation. 

7 This is directly related to the fact that if we effect an interchange among two particles in the same single-particle 
state, then Pip a will obviously be identical to %pA- At the same time, if we also have P\pA — — i Pa, then ip a must be identically 


zero. 
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then the particles of the system must all be in different single-particle states — a result 
equivalent to Pauli’s exclusion principle for electrons. 

Conversely, a statistical system composed of particles obeying an exclusion principle 
must be described by a wavefunction that is antisymmetric in its arguments. The statistics 
governing the behavior of such particles is called Fermi-Dirac, or simply Fermi; statistics 
and the constituent particles themselves are referred to as fermions. The statistical weight 
factor Wf.d.{«i) for such a system is unity so long as the n, in the distribution set are either 
0 or 1; otherwise, it is zero: 


Wf.d .{«/} 


1 if J2 n? = N, 
i 

0 if £»? > N. 
i 


(19) 8 9 


No such problems arise for systems characterized by symmetric wavefunctions: in partic¬ 
ular, we have no restriction whatsoever on the values of the numbers n,-. The statistics 
governing the behavior of such systems is called Bose-Einstein, or simply Bose, statis¬ 
tics and the constituent particles themselves are referred to as bosons , 9 The weight factor 
Wb.e. 1 «/} for such a system is identically equal to 1, whatever the values of the numbers np 

Wb.e.M = 1 ; n< = 0,1,2,.... (20) 

It should be pointed out here that there exists an intimate connection between the 
statistics governing a particular species of particles and the intrinsic spin of the particles. 
For instance, particles with an integral spin (in units of h, of course) obey Bose-Einstein 
statistics, while particles with a half-odd integral spin obey Fermi-Dirac statistics. Exam¬ 
ples in the first category are photons, phonons, jr-mesons, gravitons, He 4 -atoms, and so 
on, while those in the second category are electrons, nucleons (protons and neutrons), 
p -mesons, neutrinos, He 3 -atoms, and so on. 

Finally, it must be emphasized that, although we have derived our conclusions here 
on the basis of a study of noninteracting systems, the basic results hold for interacting 
systems as well. In general, the desired wavefunction i jr(q) will not be expressible in terms 
of the single-particle wavefunctions ui{q m ); nonetheless, it will have to be either of the 
kind ifrs(q), satisfying equation (14), or of the kind satisfying equation (15). 

8 Note that the condition YUrf =tV would be implies that all n, are either 0 or 1. On the other hand, if any of the n 
are greater than 1, the sum Yi rf would be greater than N. 

9 Possibilities other than Bose-Einstein and Fermi-Dirac statistics can arise in which the wavefunction changes by a 
complex phase factor e‘ 9 when particles are interchanged. For topological reasons, this can only happen in two dimen¬ 
sions. Quasiparticle excitations with this property are called anyons and, if 6 is a rational fraction (other than 1 or 1/2) 
of 2 n, are said to have fractional statistics and they play an important role in the theory of the fractional quantum Hall 
effect; see Wilczek (1990) andEzawa (2000). 
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5.5 The density matrix and the partition 
function of a system of free particles 

Suppose that the given system, which is composed of N indistinguishable, noninteracting 
particles confined to a cubical box of volume V, is a member of a canonical ensemble 
characterized by the temperature parameter ft. The density matrix of the system in the 
coordinate representation will be 10 

(ri,...,rjv|p|r' 1 ,...,/j V > = 1 (n,..., r N |e~* ii V 1 ,...,r' N ), (1) 

Qn(p) 

where Qn(P) is the partition function of the system: 

Qn(P)= Tr(e~ p6 ) = f (r 1 ,...,r N \e-^\r 1 ,...,r N )d 3N r. (2) 

For brevity, we denote the vector r, by the letter i and the primed vector r'. by i!. Further, 
let t/t£(1, ..., N) denote the eigenfunctions of the Hamiltonian, the suffix E representing the 
corresponding eigenvalues. We then have 

(1,.. .,N\e~ ptl \V, = J2 e ~ PE [>r?(b ■ ■ • >iV)^(T, • • .,iV')], (3) 

E 

where the summation goes over all possible values of E; compare to equation (5.3.if). 

Since the particles constituting the given system are noninteracting, we may express 
the eigenfunctions t/r£(l,...,iV) and the eigenvalues E in terms of the single-particle 
wavefunctions uftm) and the single-particle energies £/. Moreover, we find it advisable to 
work with the wave vectors fc, rather than the energies e,-; so we write 

E= ^ = 2 ^( / H + ^ + -" + fc w)’ (4) 

where the k t on the right side are the wave vectors of the individual particles. Imposing 
periodic boundary conditions, the normalized single-particle wavefunctions are 

u k (r) = 1/2 exp {z(fc • r)}, (5) 


with 


/c = 2jrV” 1/3 w; (6) 

here, n stands for a three-dimensional vector whose components have values 0, ±1, ±2,.... 
The wavefunction f of the total system would then be, see equations (5.4.16) 


10 For a general survey of the density matrix and its applications, see ter Haar (1961). 
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and (5.4.17), 


fed,. ■ • ,N) = 0V!)- 1/2 ^5pP{u fcl (1)... u fcjv (iV)}, (7) 

p 

where the magnitudes of the individual /q are such that 

(fc 2 + ■ ■ • + kf) = ^f 2 ■ (8) 

The number Sp in the expression for fe is identically equal to +1 if the particles are bosons; 
for fermions, it is +1 or —1 according to whether the permutation P is even or odd. Thus, 
quite generally, we may write 


8 P = (±l) lP] , ( 9 ) 

where [ P] denotes the order of the permutation; note that the upper sign in this expression 
holds for bosons while the lower sign holds for fermions. The factor (IV!) -1 / 2 has been 
introduced here to ensure the normalization of the total wavefunction. 

Now, it makes no difference to the wavefunction (7) whether the permutations P are 
carried out on the coordinates 1 , ...,N or on the wave vectors ki, . ..,1cn, because after all 
we are going to sum over all the N\ permutations. Denoting the permuted coordinates by 
PI,..., PN and the permuted wave vectors by Pk\, PIcn, equation (7) may be written as 


fed, • --,N) = (Nir^J^^piUk, (PI)... u kN (PN)} 
p 

= (AT!)-b 2 J2 8p{u Pk 1 (1) ... Up kN (AT)). 
P 


Equations (10a and 10b) may now be substituted into (3), with the result 


(1,... f N\e~^\ 1', = (N\ ) _1 ^ e -^ 2 ^ 2/2m 

K 


X 


!>{«*! Q,) ■ • ■ u Pk N (N ' )] 

p p 


(10a) 

(10b) 


( 11 ) 


where P and P are any of the Nl possible permutations. Now, since a permutation among 
the ki changes the wavefunction \[r at most by a sign, the quantity [i/fe*] in (11) is insen¬ 
sitive to such a permutation; the same holds for the exponential factor as well. The 
summation over K is, therefore, equivalent to (1 /AH) times a summation over all the 
vectors fci,..., /c ; y independently of one another. 

Next, in view of the Ai-fold summation over the ki, all the permutations P will make 
equal contributions toward the sum (because they differ from one another only in the 
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ordering of the fc/). Therefore, we may consider only one of these permutations, say the 
one for which Pic] =k\,.. .,P1 cn = fc/v (and hence Sp — 1 for both kinds of statistics), and 
include a factor of (TV!). The net result is: 


(l,...,N\e~ pp \l',...,N') = (AT !) -1 


e -f)h 2 <Jc 2 +■+k 2 l )/2m 


E S P I ( J>1 )“fc 1 d')} ■ ■ ■■ { u kN ( PN > < N W') I 


( 12 ) 


Substituting from (5) and noting that, in view of the largeness of V, the summations over 
the ki may be replaced by integrations, equation (12) becomes 


( 1 ,.. .,N\e~P p \V,.. ,,N') 

=- f e-^ l ^l 2 m+ik ^- V) d i k x ... 

N\{2n)™ ^ p [j 

e -ph 2 k%[/2m+ik N -(PN-N’) cp ^ 


/■ 

1 / m \3JV/2 

= m{ w#) 


)i, 


(13) 

(14) 


where 


H5) 

Here, use has been made of the mathematical result (5.3.12), which is clearly a special case 
of the present formula. 

Introducing the mean thermal wavelength, often referred to as the thermal deBroglie 
wavelength, 


h ( 2nB \ 1/2 

k= -- —= —- , (16) 

{2nmkT) 1 / 2 \ m ) 

and rewriting our coordinates as ri,...,rjv, the diagonal elements among (14) take the 
form 


{r 1} ...,r N \e pf, \r u ...,r N ) = -»T )... f(Pr N -r N )], 


(17) 


where 


fir) = exp (— nr 2 /X 2 ). 


(18) 
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To obtain the partition function of the system, we have to integrate (17) over all the 
coordinates involved. However, before we do that, we would like to make some observa¬ 
tions on the summation hirst of all, we note that the leading term in this summation, 
namely the one for which Pr, = n, is identically equal to unity (because /(0) = 1). This 
is followed by a group of terms in which only one pair interchange (among the coordi¬ 
nates) has taken place; a typical term in this group will be f(rj — rdfiri — rj) where i ± j. 
This group of terms is followed by other groups of terms in which more than one pair 
interchange has taken place. Thus, we may write 

E = 1 ± HMn + E fijfafv ± • • ■ ’ < 19 * 

P i<j i<j<k 

wher efij=f(ri — rj); again, note that the upper (lower) signs in this expansion pertain 
to a system of bosons (fermions). Now, the function fjj vanishes rapidly as the distance 
r t j becomes much larger than the mean thermal wavelength X. It then follows that if the 
mean interparticle distance, (V/N) 1 ^, in the system is much larger than the mean thermal 
wavelength, that is, if 


nX ' (2nmkT)y 2<1, (20) 

where n is the particle density in the system, then the sum i n (19) may t> e approx¬ 
imated by unity. Accordingly, the partition function of the system would become, see 
equation (17), 


Q N (V, 1 «*”'-> = ei) 

This is precisely the result obtained earlier for the classical ideal gas; see equation (3.5.9). 
Thus, we have obtained from our quantum-mechanical treatment the precise classical 
limit for the partition function Qn(V , T). Incidentally, we have achieved something more. 
First, we have automatically recovered here the Gibbs correction factor (1/Ad), which was 
introduced into the classical treatment on an ad hoc, semi-empirical basis. We, of course, 
tried to understand its origin in terms of the inherent indistinguishability of the parti¬ 
cles. Here, we see it coming in a very natural manner and its source indeed lies in the 
symmetrization of the wavefunctions of the system (which is ultimately related to the 
indistinguishability of the particles); compare to Problem 5.4. 

Second, we find here a formal justification for computing the number of microstates 
of a system corresponding to a given region of its phase space by dividing the volume of 
that region into cells of a “suitable” size and then counting instead the number of these 
cells. This correspondence becomes all the more transparent by noting that formula (21) 
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is exactly equivalent to the classical expression 


Q»0'.r ) = i t/ 


e -«P?+-+4)/2 m ( d 3N qd 3N p 


( 22 ) 


with coq — h 3N . Thirdly, in deriving the classical limit we have also evolved a criterion that 
enables us to determine whether a given physical system can be treated classically; math¬ 
ematically, this criterion is given by condition (20). Now, in statistical mechanical studies, 
a system that cannot be treated classically is said to be degenerate ; the quantity ru? may, 
therefore, be regarded as a degeneracy discriminant. Accordingly, the condition that clas¬ 
sical considerations may be applicable to a given physical system is that “the value of the 
degeneracy discriminant of the system be much less than unity.” 

Next, we note that, in the classical limit, the diagonal elements of the density matrix are 
given by 



(23) 


which is simply a product of N factors, each equal to (1 /V). Recalling that, for a single 
particle in a box of volume V, (r\p\r) = {\/V), see equation (5.3.14), we infer that in the 
classical limit there is no spatial correlation among the various particles of the system. In 
general, however, spatial correlations exist even if the particles are supposedly noninter¬ 
acting; these correlations arise from the symmetrization of the wavefunctions and their 
magnitude is quite significant if the interparticle distances in the system are comparable 
with the mean thermal wavelength of the particles. To see this more clearly, we consider 
the simplest relevant case, namely the one with N = 2. The sum J2p i s now exactly equal 
to 1 ± [/(ri 2 )] 2 . Accordingly, 



(24) 


and hence 




o 


OO 


(25) 




(26) 
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Combining (24) and (26), we obtain 

(ri,r 2 |/5|ri,r 2 }« ^[l±exp(-2Tcrl 2 /X 2 )}. (27) 

Thus, if ri 2 is comparable to k, the probability density (27) may differ considerably from 
the classical value (\/V) 2 . In particular, the probability density for a pair of bosons to 
be a distance r apart is larger than the classical, /'-independent value by a factor of 
[1 + exp(— 2nr 2 /k 2 )], which becomes as high as 2 as r->■ 0. The corresponding result for 
a pair of fermions is smaller than the classical value by a factor of [1 — exp (—2 nr 2 /k 2 )], 
which becomes as low as 0 as r -* 0. Thus, we obtain a positive spatial correlation among 
particles obeying Bose statistics and a negative spatial correlation among particles obeying 
Fermi statistics; see also Section 6.3. 

Another way of expressing correlations (among otherwise noninteracting particles) 
is by introducing a statistical interparticle potential v s (r) and then treating the particles 
classically (see Uhlenbeck and Gropper, 1932). The potential v s (r) must be such that the 
Boltzmann factor exp (— fiv s ) is precisely equal to the pair correlation function [...] in (27), 
that is, 


v s (r) = —kT In [l ± exp (— 2jrr 2 /A 2 )]. (28) 

Figure 5.1 shows a plot of the statistical potential v s (r) for a pair of bosons or fermions. 
In the Bose case, the potential is throughout attractive, thus giving rise to a “statistical 
attraction” among bosons; in the Fermi case, it is throughout repulsive, giving rise to a 
“statistical repulsion” among fermions. In either case, the potential vanishes rapidly as r 
becomes larger than k\ accordingly, its influence becomes less and less important as the 
temperature of the system rises. 



FIGURE 5.1 The statistical potential v s (r) between a pair of particles obeying Bose-Einstein statistics or Fermi-Dirac 
statistics. 
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Problems 


5.1. Evaluate the density matrix p mn of an electron spin in the representation that makes a x diagonal. 
Next, show that the value of (a z ), resulting from this representation, is precisely the same as the one 
obtained in Section 5.3. 

Hint: The representation needed here follows from the one used in Section 5.3 by carrying out a 
transformation with the help of the unitary operator 

(] = I 1/V2 W2\ 

1/V2 l/s/2 /' 

5.2. Prove that 


{q\e ^\q') = exp 




S(q-q'), 


where H(—ih d/dq, q) is the Hamiltonian operator of the system in the ^-representation, which 
formally operates on the Dirac delta function 5 (q — q'). Writing the S-function in a suitable form, 
apply this result to (i) a free particle and (ii) a linear harmonic oscillator. 

5.3. Derive the density matrix p for (i) a free particle and (ii) a linear harmonic oscillator in the 
momentum representation and study its main properties along the lines of Section 5.3. 

5.4. Study the density matrix and the partition function of a system of free particles, using the 
unsymmetrized wavefunction (5.4.3) instead of the symmetrized wavefunction (5.5.7). Show that, 
following this procedure, one encounters neither the Gibbs’ correction factor (1/A7!) nor a spatial 
correlation among the particles. 

5.5. Show that in the first approximation the partition function of a system of N noninteracting, 
indistinguishable particles is given by 


Qn(V,T) 


1 

N\k 3N 


Zn(V,T), 


where 


Zn(V,T) = 


J exp I -ft J2 Vs(r ‘j ) 

[ K i \ 


d 3N r, 


v s (r) being the statistical potential (5.5.28). Hence evaluate tht first-order correction to the equation 
of state of this system. 

5.6. Determine the values of the degeneracy discriminant (nk 3 ) for hydrogen, helium, and oxygen at 
NTP. Make an estimate of the respective temperature ranges where the magnitude of this quantity 
becomes comparable to unity and hence quantum effects become important. 

5.7. Show that the quantum-mechanical partition function of a system of N interacting particles 
approaches the classical form 


Qn(V ’ T)= niU’ / ^ W)rf3 V 3JV P 


as the mean thermal wavelength k becomes much smaller than (i) the mean interparticle distance 
(V/JV) 1 / 3 and (ii) a characteristic length ro of the interparticle potential. 11 

5.8. Prove the following theorem due to Peierls. 12 

“If H is the hermitian Hamiltonian operator of a given physical system and {</>„) an arbitrary 
orthonormal set of wavefunctions satisfying the symmetry requirements and the boundary 


u See Huang (1963, Section 10.2). 

12 See Peierls (1938) and Huang (1963, Section 10.3). 
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conditions of the problem, then the partition function of the system satisfies the following 
inequality: 

Q(fi)>J2 ex Pl-PWn\H\<Pn)}; 

n 

the equality holds when {</>„) constitute a complete orthonormal set of eigenfunctions of the 
Hamiltonian itself.” 




The Theory of Simple Gases 


We are now fully equipped with the formalism required for determining the macroscopic 
properties of a large variety of physical systems. In most cases, however, derivations run 
into serious mathematical difficulties, with the result that one is forced to restrict one’s 
analysis either to simpler kinds of systems or to simplified models of actual systems. In 
practice, even these restricted studies are carried out in a series of stages, the first stage 
of the process being highly “idealized.” The best example of such an idealization is the 
familiar ideal gas, a study of which is not only helpful in acquiring facility with the math¬ 
ematical procedures but also throws considerable light on the physical behavior of gases 
actually met with in nature. In fact, it also serves as a base on which the theory of real gases 
can be founded; see Chapter 10. 

In this chapter we propose to derive, and at some length discuss, the most basic pro¬ 
perties of simple gaseous systems obeying quantum statistics; the discussion will include 
some of the essential features of diatomic and polyatomic gases and chemical equilibrium. 


6.1 An ideal gas in a quantum-mechanical 
microcanonical ensemble 

We consider a gaseous system of N noninteracting, indistinguishable particles confined 
to a space of volume V and sharing a given energy E. The statistical quantity of interest 
in this case is C (N, V, E) which, by definition, denotes the number of distinct microstates 
accessible to the system under the macrostate (N, V,E). While determining this number, 
we must remember that a failure to take into account the indistinguishability of the parti¬ 
cles in a proper manner could lead to results which, except in the classical limit, may not 
be acceptable. With this in mind, we proceed as follows. 

Since, for large V, the single-particle energy levels in the system are very close to one 
another, we may divide the energy spectrum into a large number of “groups of levels,” 
which may be referred to as energy cells ; see Figure 6.1. Let e/ denote the average energy of 
a level, and gj the (arbitrary) number of levels, in the zth cell; we assume that all 1. In 
a particular situation, we may have m particles in the first cell, n-i particles in the second 
cell, and so on. Clearly, the distribution set {nd must conform to the conditions 

J2 n i = N CD 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00006-2 
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and 


Y riiBi = E. (2) 


Then 


n(N,V,E) = J2w{m}, (3) 

Gil 

where W{ni\ is the number of distinct microstates associated with the distribution set {///} 
while the primed summation goes over all distribution sets that conform to conditions (1) 
and (2) . Next, 


Wind = Y\w(i), 

i 


(4) 


where w(i) is the number of distinct microstates associated with the zth cell of the spectrum 
(the cell that contains m particles, to be accommodated among g, levels) while the product 
goes over all the cells in the spectrum. Clearly, w(i ) is the number of distinct ways in which 
the n L identical, and indistinguishable, particles can be distributed among the g, levels of 
the zth cell. This number, in the Bose-Einstein case, is given by, see equation (3.8.25), 


so that 


m.vJi) 


( m+gi-i )! 
nC-igi- 1 )! ’ 


Wb.e ,{»i! 


l F ( n i + Si ~ !)• 
M rijUgi- 1 )! ' 


(5) 


( 6 ) 
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In the Fermi-Dirac case, no single level can accommodate more than one particle; accord¬ 
ingly, the number //.,■ cannot exceed g;. The number w(i) is then given by the “number of 
ways in which the g, levels can be divided into two subgroups — one consisting of m levels 
(which will have one particle each) and the other consisting of (gi - rii ) levels (which will 
be unoccupied).’’ This number is given by 


W'F.D.(i) 


gi'- 

n i- (gi ~ n i)'- ’ 


(7) 


so that 


Wf.d .{«/} 


n 


gv- 


n-i'- (gi - ni)\ 


( 8 ) 


For completeness, we may include the classical — or what is generally known as the 
Maxwell-Boltzmann — case as well. There, the particles are regarded as distinguishable, 
with the result that any of the n, particles may be put into any of the g,- levels, inde¬ 
pendently of one another, and the resulting states may all be regarded as distinct; the 
number of these states is clearly (g,)”' : . Moreover, the distribution set {«,-} in this case is 
itself regarded as obtainable in 


n\\n,2\... 

different ways which, on the introduction of the Gibbs correction factor, lead to a “weight 
factor” of 


1 

ni'.n 2 \... 



( 10 ) 


see also Section 1.6, especially equation (1.6.2). Combining these two results, we obtain 

igi) n ‘ 


Wm.b.{«/} = ]""[ - 


nil 


( 11 ) 


Now, the entropy of the system would be given by 


S(N, V,E) = klnQ (N, V, E) = 



( 12 ) 


It can be shown that, under the conditions of our analysis, the logarithm of the sum on 
the right side of (12) can be approximated by the logarithm of the largest term in the sum; 
see Problem 3.4. We may, therefore, replace (12) by 


S(N, V,E)^klnW{n* i }, 


( 13 ) 
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where {n*} is the distribution set that maximizes the number W{rii\; the numbers n* are 
clearly the most probable values of the distribution numbers n t . The maximization, how¬ 
ever, is to be carried out under the restrictions that the quantities N and E remain constant. 
This can be done by the method of Lagrange’s undetermined multipliers; see Section 3.2. 
Our condition for determining the most probable distribution set {n*} now turns out to be, 
see equations (1), (2), and (13), 


<5 In W{rii) 


a'^2Sn i + p^eiSrii 
i i 


= 0 . 


(14) 


For In W{n.j}, we obtain from equations (6), (8), and (11), assuming that not only all g, but 
also all Hi 1 (so that the Stirling approximation ln(x!) ~ x\nx — x can be applied to all the 
factorials that appear in these expressions), 


In W{iii} = y~dn w{i) 
i 


mini — — a 


gi i 


In ( 1 — a 


gi 


(15) 


where a — — 1 for the B.E. case, +1 for the F.D. case, and 0 for the M.B. case. Equation (14) 
then becomes 


E 


In ^ — — a'j — a — ft ei 


Sn : = 0. 


(16) 


In view of the arbitrariness of the increments <5n z - in (16), we must have (for all i ) 



— a — ftsi = 0 , 


(17) 


so that 1 


gi 


gU+fiSi _|_ a 


(18) 


The fact that n* turns out to be directly proportional to g, prompts us to interpret the 
quantity 


gi e a+ ^ e i + a ’ 


(18a) 


which is actually the most probable number of particles per energy level in the zth cell, as 
the most probable number of particles in a single level of energy e,-. Incidentally, our final 
result (18a) is totally independent of the manner in which the energy levels of the particles 


For a critique of this derivation, see Landsberg (1954a, 1961). 
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are grouped into cells, so long as the number of levels in each cell is sufficiently large. As 
shown in Section 6.2, formula (18a) can also be derived without grouping energy levels 
into cells at all; in fact, it is only then that this result becomes truly acceptable! 
Substituting (18) into (15), we obtain for the entropy of the gas 

= \ n *i (“ + P s i) + - In {1 + ae-“- ps t }]. (19) 

i 

The first sum on the right side of (19) is identically equal to aN while the second sum is 
identically equal to (3E. For the third sum, therefore, we have 

^ J2 gi In {1 + ae- 01 -^ J = - — aN — pE. (20) 

i 

Now, the physical interpretation of the parameters a and ft here is going to be precisely the 
same as in Section 4.3, namely 


M 1 

a = ~W P = kT’ 

for confirmation see Section 6.2. The right side of equation (20) is, therefore, equal to 

S | nN E G — (E— TS ) _ PV 
k + ~kf~kf~ kT “ IcT' 

The thermodynamic pressure of the system is, therefore, given by 


PV=^ 

a 


£[gdn{l + «e-“-^j]. 


( 21 ) 


( 22 ) 


(23) 


In the Maxwell-Boltzmann case (a -> 0), equation (23) takes the form 

PV = kT^gie-*-^ = krJ2 n * = NkT ’ 


(24) 


which is the familiar equation of state of the classical ideal gas. Note that equation (24) for 
the Maxwell-Boltzmann case holds irrespective of the details of the energy spectrum e,. 

It will be recognized that the expression a~ l £T[ ] in equation (23), being equal to the 
thermodynamic quantity (PV/kT), ought to be identical to the q-potential of the ideal gas. 
One may, therefore, expect to obtain from this expression all the macroscopic properties of 
this system. However, before demonstrating this, we would like to first develop the formal 
theory of an ideal gas in the canonical and grand canonical ensembles. 
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6.2 An ideal gas in other quantum-mechanical 
ensembles 

In the canonical ensemble the thermodynamics of a given system is derived from its 
partition function 


Qjv (V,T) = J2 e ~ PE ’ (U 

E 

where E denotes the energy eigenvalues of the system while /3 = 1/kT. Now, an energy 
value E can be expressed in terms of the single-particle energies e; for instance, 

E='Y^n E s, ( 2 ) 

8 


where n f: is the number of particles in the single-particle energy state e. The values of the 
numbers n E must satisfy the condition 


J2 n *= N - (3) 


Equation (1) may then be written as 



-PY. n eC 

E 


(4) 


where g{n E } is the statistical weight factor appropriate to the distribution set { n !: } and the 
summation goes over all distribution sets that conform to the restrictive condition (3). 
The statistical weight factor in different cases is given by 


gB.E.{H E ) = 1. 


(5) 


gF.D.lHd 


11 if all n E = 0 or 1 
10 otherwise, 


( 6 ) 


and 


gM.B.bZs} = n —p (7) 

e s ' 

Note that in the present treatment we are dealing with single-particle states as individ¬ 
ual states, without requiring them to be grouped into cells; indeed, the weight factors (5), 
(6), and (7) follow straightforwardly from their respective predecessors (6.1.6), (6.1.8), and 
(6.1.11) by putting all g L = 1. 
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First of all, we work out the Maxwell-Boltzmann case. Substituting (7) into (4), we get 


Qn(V,T) = J2 
t n e } 


n(-T 


= -E' 

N'-m 




( 8 ) 


Since the summation here is governed by condition (3), it can be evaluated with the help 
of the multinomial theorem, with the result 


Qn(V,T) = 


1 

m 



t 

Nl 


[Qi(V, T)] n , 


(9) 


in agreement with equation (3.5.15). The evaluation of Qi is, of course, straightforward. 
One obtains, using the asymptotic formula (2.4.7) for the number of single-particle states 
with energies lying between e and s + de, 


Qi(V,T) = ^2 e ^ (2m) 3/l2 J e ^ e e l ^ 2 de 

e o 

= v/x 3 , (10) 

where X [— h/(2TtmkT )h 2 ] is the mean thermal wavelength of the particles. Hence 

V N 

Qn(V, T ) = (11) 

from which complete thermodynamics of this system can be derived; see, for example, 
Section 3.5. Further, we obtain for the grand partition function of this system 

OO 

a(z, V,T)=J2 z N Qn(V, T) = exp(zVA 3 ); (12) 

N =0 

compare to equation (4.4.3). We know that the thermodynamics of the system follows 
equally well from the expression for GL. 

In the Bose-Einstein and Fermi-Dirac cases, we obtain, by substituting (5) and (6) 
into (4), 


Qn(V. 


, T) = J2' 


{n e } 


-PT. n ee 


(13) 
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the difference between the two cases, B.E. and F.D., arises from the values that the 
numbers n E can take. Now, in view of restriction (3] on the summation J2'> an explicit 
evaluation of the partition function Qn in these cases is rather cumbersome. The grand 
partition function <3., on the other hand, turns out to be more easily tractable; we have 


&(z, V, 


T)=Z 


N= 0 


>E' 




-PY.n e e~ 
e e 


(14a) 


= E 

N= 0 

Now, the double summation in (14b) — first over the numbers n E constrained by a fixed 
value of the total number N, and then over all possible values of N — is equivalent to 
a summation over all possible values of the numbers n E , independently of one another. 
Hence, we may write 


E'n (*-")' 

I'M V V ’ 


(14b) 


<2(z, V, T) 



no,ni,... 


EK'"")"’ 

no 

i 

sT 

1 

In? 

_1 


(15) 


Now, in the Bose-Einstein case the n E can be either 0 or 1 or 2 or ..., while in the Fermi- 
Dirac case they can be only 0 or 1. Therefore, 


&(z, V, T) = 


n___ 

(1 - ze~P £ ) 
Y\0- + ze~P e ) 


in the B.E. case, with ze < 1 
in the F.D. case. 


(16a) 

(16b) 


The q-potential of the system is thus given by 


q(z, V, T) 


PV 

kf 


In (3(z, V, T) 


= ^^ln(l^ze ^ £ ); 

E 


(17) 


compare to equation (6.1.23), with g,- = 1. The identification of the fugacity z with the 
quantity e~ a of equation (6.1.23) is quite natural; accordingly, a = —fi/kT. As usual, the 
upper (lower) sign in equation (17) corresponds to the Bose (Fermi) case. 

In the end, we may write our results for q in a form applicable to all three cases: 

PV 1 

q(z, V, T) = — = - V ln(l + aze “( ie ), 

1C 1 & 


( 18 ) 
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where a — — 1, +1, or 0, depending on the statistics governing the system. In particular, the 
classical case (a 0) gives 


qM.B.=zJ2 e ^ = zQ\, 

e 

in agreement with equation (4.4.4). From (18), it follows that 

n=z(^) — 

\dzjyj z-j z-i e P £ + a 

and 


V3 PJz,v 


z -i e /fs -i- a' 


(19) 


( 20 ) 


( 21 ) 


At the same time, the mean occupation number {n E ) of level e turns out to be, see equa¬ 
tions (14a) and (17), 


{n e ) = 


1 

1 


a 

P 

V ds ) Z) T } all others 


ft \ 9e / z ,T, all other s 

1 

z~ l eP E + a’ 


( 22 ) 


in keeping with equations (20) and (21). Comparing our final result (22) with its coun¬ 
terpart (6.1.18a), we find that the mean value (n) and the most probable value n* of the 
occupation number n of a single-particle state are indeed identical. 


6.3 Statistics of the occupation numbers 

Equation (6.2.22) gives the mean occupation number of a single-particle state with energy 
e as an explicit function of the quantity (e - p)/kT: 

{ne) = e (s-^)/kT + a - (1) 

The functional behavior of this number is shown in Figure 6.2. In the Fermi-Dirac case 
(a = + 1), the mean occupation number never exceeds unity, for the variable n e itself 
cannot have a value other than 0 or 1. Moreover, for e < p and | e — p | » kT, the mean occu¬ 
pation number tends to its maximum possible value 1. In the Bose-Einstein case (a = —1), 
we must have p < all e; see equation (6.2.16a). In fact, when p becomes equal to the low¬ 
est value of e, say so, the occupancy of that particular level becomes infinitely high, which 
leads to the phenomenon of Bose-Einstein condensation-, see Sections 7.1 and 7.2. For 
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FIGURE 6.2 The mean occupation number {n B ) of a single-particle energy state e in a system of noninteracting 
particles: curve 1 is for fermions, curve 2 for bosons, and curve 3 for the Maxwell-Boltzmann particles. 


p < so, all values of (s — p) are positive and the behavior of all (n e ) is nonsingular. Finally, 
in the Maxwell-Boltzmann case (a — 0), the mean occupation number takes the familiar 
form 


<« e )m.b. = expjO - s)/kT } oc exp (-e/kT). (2) 

The important thing to note here is that the distinction between the quantum statistics 
(a = ±1) and the classical statistics (a = 0) becomes imperceptible when, for all values of e 
that are of practical interest, 


exp{(e — p)/kT] » 1. (3) 

In that event, equation (1) essentially reduces to (2) and we may write, instead of (3), 

{n E ) « 1. (4) 

Condition (4) is quite understandable, for it implies that the probability of any of the n e 
being greater than unity is quite negligible, with the result that the classical weight factors 
g{n E ], as given by equation (6.2.7), become essentially equal to 1. The distinction between 
the classical treatment and the quantum-mechanical treatment then becomes rather 
insignificant. Correspondingly, we find, see Figure 6.2, that for large values of (e - p)/kT 
the quantum curves 1 and 2 essentially merge into the classical curve 3. Since we already 
know that the higher the temperature of the system the better the validity of the classical 
treatment, condition (3) also implies that p, the chemical potential of the system, must 
be negative and large in magnitude. This means that the fugacity z[= exp (p/kT)] of the 
system must be much smaller than unity; see also equation (6.2.22). One can see, from 
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equations (4.4.6) and (4.4.29), that this is further equivalent to the requirement 

NX 3 


(5) 


which agrees with condition (5.5.20). 

We shall now examine statistical fluctuations in the variable n B . Going a step further 
from the calculation that led to equation (6.2.22), we have 




2 

a 


z,T, all others 


( 6 ) 


it follows that 


(n 3 ) - {n E ) 2 



2 

In® 


z,T, all others 



(7) 


For the relative mean-square fluctuation, we obtain (irrespective of the statistics obeyed 
by the particles) 





( 8 ) 


of course, the actual value of this quantity will depend on the statistics of the particles 
because, for a given particle density (N/V) and a given temperature T, the value of z will 
be different for different statistics. 

It seems more instructive to write (8) in the form 


In the classical case {a — 0), the relative fluctuation is normal. In the Fermi-Dirac case, it 
is given by 1 /(n e ) — 1, which is below normal and tends to vanish as (n e ) -*■ 1. In the Bose- 
Einstein case, the fluctuation is clearly above normal. 2 Obviously, this result would apply 
to a gas of photons and, hence, to the oscillator states in the black-body radiation. In the 
latter context, Einstein derived this result as early as 1909 following Planck’s approach and 
even pointed out that the term 1 in the expression for the fluctuation may be attributed 
to the wave character of the radiation and the term 1 /(n e ) to the particle character of the 
photons; for details, see Kittel (1958), ter Haar (1968). 

Closely related to the subject of fluctuations is the problem of “statistical correlations 
in photon beams,” which have been observed experimentally (see Hanbury Brown and 


2 The special case of fluctuations in the ground state occupation number, no, of a Bose-Einstein system has been 
discussed by Wergeland (1969) and by Fujiwara, ter Haar, and Wergeland (1970). 
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Twiss, 1956, 1957, 1958) and have been explained theoretically in terms of the quantum- 
statistical nature of these fluctuations (see Purcell, 1956; Kothari and Auluck, 1957). For 
further details, refer to Mandel, Sudarshan, and Wolf (1964); and Holliday and Sage (1964). 

For greater understanding of the statistics of the occupation numbers, we evaluate 
the quantity p E (n), which is the probability that there are exactly n particles in a state of 
energy e. Referring to equation (6.2.14b), we infer that p E (n) cx {ze~^ E ) n . On normalization, 
it becomes in the Bose-Einstein case 

Psin) Ib.e. = ( ze-P [l -ze _/j£ ] 

= (J*U) n ^— = «*»" cm) 

V (« s > + 1 / (n E ) + 1 ((n e ) + 1)” +1 


In the Fermi-Dirac case, we get 

Ps(n) If.d. = ( ze~P + 

{ 1 — (n B ) for n = 0 

. . f , (ID 

(n B ) for n=\. 

In the Maxwell-Boltzmann case, we have p E (n) oc (ze~^ E ) n /n\ instead; see equation (6.2.8). 
On normalization, we get 


Vein) Im.b. 


(ze P s ) n /n\ 
exp ( ze~P E ) 


« n °» n c -(n e ) 

n\ 


( 12 ) 


Distribution (12) is clearly a Poisson distribution, for which the mean square deviation of 
the variable in question is equal to the mean value itself; compare to equation (9), with 
a — 0. It also resembles the distribution of the total particle number N in a grand canonical 
ensemble consisting of ideal, classical systems; see Problem 4.4. We also note that the ratio 
p E (n)/p E (n — 1) in this case varies inversely with n, which is a “normal” statistical behavior 
of uncorrelated events. 

On the other hand, the distribution in the Bose-Einstein case is geometric, with a con¬ 
stant common ratio (n E )/({n E ) + 1). This means that the probability of a state e acquiring 
one more particle for itself is independent of the number of particles already occupying 
that state; thus, in comparison with the “normal” statistical behavior, bosons exhibit a spe¬ 
cial tendency of “bunching” together, which means a positive statistical correlation among 
them. In contrast, fermions exhibit a negative statistical correlation. 


6.4 Kinetic considerations 

The thermodynamic pressure of an ideal gas is given by equation (6.1.23) or (6.2.18). In 
view of the largeness of volume V, the single-particle energy states e would be so close 
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to one another that a summation over them may be replaced by integration. One thereby 
gets 


a 


oo 

M 


l+aze-fV> 

J h 6 


AnkT 
ah 3 




aze ^ E(p ) ds 
+ aze-^W ^ dp P 


The integrated part vanishes at both limits while the rest of the expression reduces to 

OO 

de' 


P = 


An 

w 


j z~ 1 e^ B( 


( P> + a\ dp 


P~f- P d P- 


(1) 


Now, the total number of particles in the system is given by 


N = 


[ Vd 3 p AnV f 

J = ~W J 


1 


h 3 h 3 J z~ l e^ s{ JP+a 


p 2 dp. 


( 2 ) 


Comparing (1) and (2), we can write 


3 V 



1 

-n(pu), 


(3) 


where n is the particle density in the gas and u the speed of an individual particle. If the 
relationship between the energy e and the momentum p is of the form s oc p s , then 


P = 


s s E 

-n{s) =-; 

3 3 V 


(4) 


the particular cases 5=1 and 5 = 2 are pretty easy to recognize. It should be noted here 
that results (3) and (4) hold independently of the statistics obeyed by the particles. 

The structure of formula (3) suggests that the pressure of the gas arises essentially from 
the physical motion of the particles; it should, therefore, be derivable from kinetic consid¬ 
erations alone. To do this, we consider the bombardment, by the particles of the gas, on 
the walls of the container. Let us take, for example, an element of area dA on one of the 
walls normal to the z-axis, see Figure 6.3, and focus our attention on those particles whose 
velocity lies between u and u + du\ the number of such particles per unit volume may be 
denoted by nf{u)du, where 


f(u)du = 1 . 


( 5 ) 
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-► Z 

(udt) 




FIGURE 6.3 The molecular bombardment on one of the walls of the container. 

Now, the question is: how many of these particles will strike the area dA in time dtl The 
answer is: all those particles that happen to lie in a cylindrical region of base dA and height 
udt, as shown in Figure 6.3. Since the volume of this region is (dA ■ u)dt, the number of 
such particles would be {(dA - u)dt x nf(u)du}. On reflection from the wall, the normal 
component of the momentum of a particle would undergo a change from p z to — p z , as 
a result, the normal momentum imparted by these particles per unit time to a unit area 
of the wall would be 2 p z {u z nf(u)du}. Integrating this expression over all relevant u, we 
obtain the total normal momentum imparted per unit time to a unit area of the wall by all 
the particles of the gas which, by definition, is the kinetic pressure of the gas: 


OO OO OO 


P = 2n 


/ / / p z u z f(u)du x duydu z . 


( 6) 3 


u x =-oo Uy=-oo U z =0 


Since (i) f(u) is a function of u alone and (ii) the product ( p z u z ) is an even function of u z , 
the foregoing result may be written as 


P=n j ( p z Uz)f(u)du. 


(7) 


all u 


Comparing (7) with (5), we obtain 


P = n{p z u z ) = n(puco$r 9) 


= 3 n (pu), 


( 8 ) 

(9) 


which is identical to (3). 


3 Clearly, only those velocities for which u z > 0 are relevant here. 
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In a similar manner, we can determine the rate of effusion of the gas particles through 
a hole (of unit area) in the wall. This is given by, compared to (6), 


OO OO OO 


'J-xdllydllz 


-Iff u z f(u)du x 

u x =-o o u y =-o O u z =0 
2: r 7i / 2 oo 

-III {ucos8f(u)}(u 2 sin0 dudd d<j >); 


( 10 ) 


( 11 ) 


0=0 6=0 «=0 


note that the condition u z > 0 restricts the range of the angle 6 between the values 0 and 
7r/2. Carrying out integrations over 0 and cp, we obtain 


R= nn J f(u)u 3 


du. 


( 12 ) 


In view of the fact that 


(JXJ 

J f(u)(4nu 2 du) = 1, 


(5a) 


equation (12) maybe written as 


R=^n(u). (13) 

Again, this result holds independently of the statistics obeyed by the particles. 

It is obvious that the velocity distribution among the effused particles is considerably 
different from the one among the particles inside the container. This is due to the fact that, 
firstly, the velocity component u z of the effused particles must be positive (which intro¬ 
duces an element of anisotropy into the distribution) and, secondly, the particles with 
larger values of u z appear with an extra weightage, the weightage being directly propor¬ 
tional to the value of u z ; see equation (10). As a result of this, (i) the effused particles carry 
with them a net forward momentum, thus causing the container to experience a recoil 
force, and (ii) they carry away a relatively large amount of energy per particle, thus leaving 
the gas in the container at not only a progressively decreasing pressure and density but 
also a progressively decreasing temperature; see Problem 6.14. 


6.5 Gaseous systems composed of molecules 
with internal motion 

In most of our studies so far we have considered only the translational part of the molecu¬ 
lar motion. Though this aspect of motion is invariably present in a gaseous system, other 
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aspects, which are essentially concerned with the internal motion of the molecules, also 
exist. It is only natural that in the calculation of the physical properties of such a sys¬ 
tem, contributions arising from these motions are also taken into account. In doing so, 
we shall assume that (i) the effects of the intermolecular interactions are negligible and (ii) 
the nondegeneracy criterion 


q fill 

UX = (2nmkT) 3 / 2 <<C (5 - 5 ' 20) 

is fulfilled; this makes our system an ideal, Boltzmannian gas. Under these assumptions, 
which hold sufficiently well in a large number of applications, the partition function of the 
system is given by 


Qn(V,T) = 


1 


[Qi(V, T)] n , 


( 1 ) 


where 


Qi(V,T) = (2) 

the factor within the curly brackets is the familiar translational partition function of a 
molecule, while j(T) is the partition function corresponding to internal motions. The latter 
may be written as 


]\T) = J2Si e ~ Si/kT ’ ® 

i 

where e ; is the energy associated with a state of internal motion (characterized by the 
quantum numbers i ), while g, is the multiplicity of that state. 

The contributions made by the internal motions of the molecules, over and above 
the translational degrees of freedom, follow straightforwardly from the function j(T). We 
obtain 


h int = —NkTlnj, (4) 

Mint = —kTlnj, (5) 

S int = Attain;+ 7^ 111/), (6) 

U int = NkT 2 ^-In j, (7) 

and 

(Cv) tat = M^{l*Aln/j. (8) 
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Thus, the central problem in this study is to derive an explicit expression for the function 
j(T) from a knowledge of the internal states of the molecules. For this, we note that the 
internal state of a molecule is determined by (i) the electronic state, (ii) the state of the 
nuclei, (iii) the vibrational state, and (iv) the rotational state. Rigorously speaking, these 
four modes of excitation mutually interact; in many cases, however, they can be treated 
independently of one another. We can then write 

j(T) = 7e I ec 1T)j nu c (T)7vi b (T)f rot (T), (3a) 

with the result that the net contribution made by the internal motions to the various 
thermodynamic properties of the system is given by a simple sum of the four respective 
contributions. There is one interaction, however, that plays a special role in the case of 
homo nuclear molecules, such as AA, and which is between the states of the nuclei and the 
rotational states. In such a case, we better write 


j(T) — f e I ec (T)f n u c—ro t( T)j' v i t, ( T). (3b) 

We now examine this problem for various systems in the order of increasing complexity. 

6.5.A Monatomic molecules 

For simplicity, we consider a monatomic gas at temperatures such that the thermal energy 
kT is small in comparison with the ionization energy £i 0n ; for different atoms, this amounts 
to the condition T Ei on /k ~ 10 4 — 10 5 K. At these temperatures, the number of ionized 
atoms in the gas would be insignificant. The same would be true of atoms in the excited 
states, for the separation of any of the excited states from the ground state of the atom 
is generally of the same order of magnitude as the ionization energy itself. Thus, we may 
regard all atoms in the gas to be in their (electronic) ground state. 

Now, there is a special class of atoms, namely He, Ne, A,..., which, in their ground state, 
possess neither orbital angular momentum nor spin (L = S — 0). Their (electronic) ground 
state is clearly a singlet, with g e = 1. The nucleus, however, possesses a degeneracy that 
arises from the possibility of different orientations of the nuclear spin. 4 If the value of 
this spin is S n , the corresponding degeneracy factor g n = 2S n + 1. Moreover, a monatomic 
molecule cannot have any vibrational or rotational states. The internal partition function 
(3a) of such a molecule is, therefore, given by 

j(T) (y)gr.sL = ge ' gn — 2 S n + 1. (9) 

4 As is well known, the presence of the nuclear spin gives rise to the so-called hyperflne structure in the electronic 
states. However, the intervals of this structure are such that, for practically all temperatures of interest, they are small in 
comparison with kT ; for concreteness, these intervals correspond to T -values of the order of 10 _1 to 10° K. Accordingly, 
in the evaluation of the partition function j(T), the hyperfine splitting of the electronic state may be disregarded while 
the multiplicity introduced by the nuclear spin may be taken into account through a degeneracy factor. 
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Equations (4) through (8) then tell us that the internal motions in this case contribute only 
toward properties such as the chemical potential and the entropy of the gas; they do not 
contribute toward the internal energy and the specific heat. 

If, on the other hand, the ground state does not possess orbital angular momentum but 
possesses spin (L = 0, S / 0 — as, for example, in the case of alkali atoms), then the ground 
state will still have no fine structure; it will, however, have a degeneracy g e — 2S + 1. As a 
result, the internal partition function j(T) will get multiplied by a factor of (2S + 1) and 
the properties such as the chemical potential and the entropy of the gas will get modified 
accordingly. 

In other cases, the ground state of the atom may possess both orbital angular momen¬ 
tum and spin (L / 0, S ^ 0); the ground state would then possess a definite fine structure. 
The intervals of this structure are, in general, comparable to kT ; hence, in the evaluation 
of the partition function, the energies of the various components of the fine structure will 
have to be taken into account. Since these components differ from one another in the value 
of the total angular momentum /, the relevant partition function may be written as 

Mec(T) = J2W +1)e ~ ei/kT - (10) 

/ 

The foregoing expression simplifies considerably in the following limiting cases: 

(a) kT » all ej ; then 

jelec(T) ~ X/2-/+ 1) = (21+ 1)(2S+ 1). (10a) 

I 

(b) kT « all sj) then 

jelec(T) — (2/o + \)e~ E ° /kT , (10b) 

where J 0 is the total angular momentum, and e 0 the energy, of the atom in the lowest state. 
In either case, the electronic motion makes no contribution toward the specific heat of 
the gas. Of course, at intermediate temperatures, we do obtain a contribution toward this 
property. And, in view of the fact that both at high and low temperatures the specific heat 
tends to be equal to the translational value | Nk, it must pass through a maximum at a 
temperature comparable to the separation of the fine structure levels. 5 Needless to say, the 
multiplicity (2 S n + 1) introduced by the nuclear spin must be taken into account in each 
case. 

6.5.B Diatomic molecules 

Now we consider a diatomic gas at temperatures such that kT is small compared to the 
energy of dissociation; for different molecules, this amounts once again to the condition 

5 It seems worthwhile to note here that the values of Aej/k for the components of the normal triplet term of oxygen 
are 230 K and 320 K, while those for the normal quintuplet term of iron range from 600 to 1,400 K. 
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T <£ s^iss/lc ~ 10 4 —10 5 K. At these temperatures the number of dissociated molecules in 
the gas would be insignificant. At the same time, in most cases, there would be practi¬ 
cally no molecules in the excited states as well, for the separation of any of these states 
from the ground state of the molecule is in general comparable to the dissociation energy 
itself. 6 Accordingly, in the evaluation of j(T), we have to take into account only the lowest 
electronic state of the molecule. 

The lowest electronic state, in most cases, is nondegenerate: g e — 1. We then need not 
consider any further the question of the electronic state making a contribution toward 
the thermodynamic properties of the gas. However, certain molecules (though not very 
many) have, in their lowest electronic state, either (i) a nonzero orbital angular momentum 
(A / 0) or (ii) a nonzero spin (S / 0) or (iii) both. In case (i), the electronic state acquires a 
twofold degeneracy corresponding to the two possible orientations of the oribital angular 
momentum relative to the molecular axis; 7 as a result, g e = 2. In case (ii), the state acquires 
a degeneracy 2S + 1 corresponding to the space quantization of the spin. 8 

In both these cases the chemical potential and the entropy of the gas are modified by 
the multiplicity of the electronic state, while the energy and the specific heat remain unaf¬ 
fected. In case (iii), we encounter a fine structure that necessitates a rather detailed study 
because the intervals of this structure are generally of the same order of magnitude as JcT. 
In particular, for a doublet fine-structure term, such as the one that arises in the molecule 
NO (rii/ 2 , 3/2 with a separation of 178 K, the components themselves being A-doublets), 
we have for the electronic partition function 

;'elec(T) = gO + gie _A/fc:r , (11) 

where go and gi are the degeneracy factors of the two components while A is their separa¬ 
tion energy. The contribution made by (11) toward the various thermodynamic properties 
of the gas can be calculated with the help of formulae (4) through (8). In particular, we 
obtain for the contribution toward the specific heat 

, r , _ (A/kT) 2 _ 

(WW Nk (i + (g o/ g l)e A/fcr] (1 + (g 1 /g o)e -A/^ ] • 

We note that this contribution vanishes both for T <$; A/k and for T » A jk and is maxi¬ 
mum for a certain temperature ~ A //c; compare to the corresponding situation in the case 
of monatomic molecules. 


6 An odd case arises with oxygen. The separation between its normal term 3 E and the first excited term 1 A is about 
11,250 K, whereas the dissociation energy is about 55,000 K. The relevant factor e~ sl ^ kT , therefore, can be quite significant 
even when the factor e~ e ^snl lcT j s no t, say for T ~ 2000 to 6000 K. 

7 Strictly speaking, the term in question splits into two levels — the so-called A-doublet. The separation of the levels, 
however, is such that we can safely neglect it. 

8 The separation of the resulting levels is again negligible from the thermodynamic point of view; as an example, one 
may cite the very narrow triplet term of O 2 . 
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We now consider the effect of the vibrational states of the molecules on the thermo¬ 
dynamic properties of the gas. To have an idea of the temperature range over which this 
effect would be significant, we note that the magnitude of the corresponding quantum of 
energy, namely hco, for different diatomic gases is of order 10 3 K. Thus, we would obtain full 
contributions (consistent with the dictates of the equipartition theorem) at temperatures 
of the order of 10 4 K or more, and practically no contribution at temperatures of the order 
of 10 2 K or less. Let us assume that the temperature is not high enough to excite vibra¬ 
tional states of large energy; the oscillations of the nuclei then remain small in amplitude 
and hence harmonic. The energy levels for a mode of frequency co are then given by the 
well-known expression (n + 

The evaluation of the vibrational partition function j^iT) is quite elementary; see 
Section 3.8. In view of the rapid convergence of the series involved, the summation may 
formally be extended to n — oo. The corresponding contributions toward the various ther¬ 
modynamic properties of the system are then given by equations (3.8.16) through (3.8.21). 
In particular, 



(13) 


We note that for T » & v , the vibrational specific heat is very nearly equal to the equipar¬ 
tition value Nk; otherwise, it is always less than Nk. In particular, for T @ v> the specific 
heat tends to zero (see Figure 6.4); the vibrational degrees of freedom are then said to be 
“frozen.” 

At sufficiently high temperatures, when vibrations with large n are also excited, the 
effects of anharmonicity and of interaction between the vibrational and the rotational 
modes of the molecule can become important. 9 10 However, since this happens only at large 
n, the relevant corrections to the various thermodynamic quantities can be determined 
even classically; see Problems 3.29 and 3.30. One finds that the first-order correction to 
C vib is directly proportional to the temperature of the gas. 

Finally, we consider the effect of (i) the states of the nuclei and (ii) the rotational states 
of the molecule; wherever necessary, we shall take into account the mutual interaction of 
these modes. This interaction is of no relevance in the case of heteronuclear molecules, 
such as AB; it is, however, important in the case of homonuclear molecules, such as AA. We 
may, therefore, consider the two cases separately. 

The states of the nuclei in the heteronuclear case may be treated separately from 
the rotational states of the molecule. Proceeding in the same manner as for monatomic 
molecules, we conclude that the effect of the nuclear states is adequately taken care of 

9 It may be pointed out that the vibrational motion of a molecule is influenced by the centrifugal force arising from 
the molecular rotation. This leads to an interaction between the rotational and the vibrational modes. However, unless 
the temperature is too high, this interaction can be neglected and the two modes treated independently of one another. 

10 In principle, these two effects are of the same order of magnitude. 
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FIGURE 6.4 The vibrational specific heat of a gas of diatomic molecules. At T — ©„, the specific heat is already 
about 93 percent of the equipartition value. 


through a degeneracy factor g n . Denoting the spins of the two nuclei by S A and Sb, 

g n = (2S A +l)(2S B +l). (14) 

As before, we obtain a finite contribution toward the chemical potential and the entropy 
of the gas but none toward the internal energy and the specific heat. 

Now, the rotational levels of a linear “rigid” rotator, with two degrees of freedom (for 
the axis of rotation) and the principal moments of inertia (1,1, 0), are given by 

e rot = 1(1+ l)h 2 /2I, 1= 0,1,2,...; (15) 

here, / = /ir, 2 where // [= ni] m 2 /(mi + m 2 )] is the reduced mass of the nuclei and ro the equi¬ 
librium distance between them. The rotational partition function of the molecule is then 
given by 


jrot(G> = ^(2Z+l)exp 
1=0 

00 I 0 1 h 2 

= £(2Z+l)exp|-Z(Z+l)-M; 0 r =^r. (16) 

1=0 * ' 

The values of © r , for all gases except the ones involving the isotopes H and D, are much 
smaller than room temperature. For example, the value of 0 r for HC1 is about 15 K, for N 2 , 
O 2 , and NO it lies between 2 K and 3 K, while for CI 2 it is about one-third of a degree. On 
the other hand, the values of ©,■ for H 2 , D 2 , and HD are, respectively, 85 K, 43 K, and 64 K. 


— 1 ( 1 + 1 ) 


h 2 

2IkT 
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These numbers give us an idea of the respective temperature ranges in which the effects 
arising from the discreteness of the rotational states are expected to be important. 

For T » © r , the spectrum of the rotational states may be approximated by a contin¬ 
uum. The summation in (16) is then replaced by an integration: 

OO 

jrot(T) « J(2Z+l)expj —/(/+l)^|rf/= (17) 

o 

The rotational specific heat is then given by 


(Cy)rot — (Vfc, 


(18) 


consistent with the equipartition theorem. 

A better evaluation of the sum in (16) can be made with the help of the Euler-Maclaurin 
formula, namely 


“rill 

J2f (n) = / f(x) dx + 2f (0) - T^/ (0) + 720^ " (0) “ 

n= 0 n 


1 


30,240 


/ v (0) + - 


(19) 


Writing 


f(x) = (2x+ l)exp(-x(x+ l)@ r /T), 


one obtains 


7 rot CO = 


T 1 1 0 r 4 

0^ + 3 + 15T + 3l5 



2 


( 20 ) 


which is the so-called Mulholland’s formula; as expected, the main term of this formula is 
identical to the classical partition function (17). The corresponding result for the specific 
heat is 


(Cv)rot — Nk\ 1 + — 




( 21 ) 


which shows that at high temperatures the rotational specific heat decreases with temper¬ 
ature and ultimately tends to the classical value Nk. Thus, at high (but finite) temperatures 
the rotational specific heat of a diatomic gas is greater than the classical value. On the other 
hand, it must go to zero as T -* 0. We, therefore, conclude that it passes through at least 
one maximum. Numerical studies show that there is only one maximum that appears at a 
temperature of about 0.80, and has a value of about 1.1A1A:; see Figure 6.5. 
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FIGURE 6.5 The rotational specific heat of a gas of heteronuclear diatomic molecules. 

In the other limiting case, when T © r , one may retain only the first few terms of the 
sum in (16); then 


trot CO = 1 + 3e~ 2&r ' T + 5 g _60 r/T + ... 

from which one obtains, in the lowest approximation, 


( 22 ) 



(23) 


Thus, as T -*■ 0, the specific heat drops exponentially to zero; see again Figure 6.5. We, 
therefore, conclude that at low enough temperatures the rotational degrees of freedom of 
the molecules are also “frozen.” 

At this stage it appears worthwhile to remark that, since the internal motions of the 
molecules do not make any contribution toward the pressure of the gas (Aj nt being inde¬ 
pendent of V), the quantity (Cp - Cv) is the same for a diatomic gas as for a monatomic 
one. Moreover, under the assumptions made in the very beginning of this section, the 
value of this quantity at all temperatures of interest would be equal to the classical value 
Nk. Thus, at sufficiently low temperatures (when rotational as well as vibrational degrees 
of freedom of the molecules are “frozen”), we have, by virtue of the translational motion 
alone, 


C v =\n1c, C p = 5 -NK ; y=jj. 


(24) 


As temperature rises, the rotational degrees of freedom begin to “loosen up” until we 
reach temperatures that are much larger than © r but much smaller than ©„; the rotational 
degrees of freedom are then fully excited while the vibrational ones are still “frozen.” 
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T (in K)-► 


FIGURE 6.6 The rotational-vibrational specific heat, Cp, of the diatomic gases HD, HT, and DT. 


Accordingly, for © r < T ® v , 

Cv = -Nk, Cp = - Nk; y = (25) 

z Zb 

As temperature rises further, the vibrational degrees of freedom as well start loosening up, 
until we reach temperatures that are much larger than ® v . Then, the vibrational degrees of 
freedom are also fully excited and we have 

C v = 7 -Nk, C P =^Nk; y= % -. (26) 

These features are displayed in Figure 6.6 where the experimental results for Cp are plot¬ 
ted for three gases HD, HT, and DT. We note that, in view of the considerable difference 
between the values of 0,- and ® v , the situation depicted by (25) prevails over a consider¬ 
ably large range of temperatures. In passing, it may be pointed out that, for most diatomic 
gases, the situation at room temperatures corresponds to the one depicted by (25) . 

We now study the case of homonuclear molecules, such as AA. To start with, we consider 
the limiting case of high temperatures where classical approximation is admissible. The 
rotational motion of the molecule may then be visualized as a rotation of the molecular 
axis, that is, the line joining the two nuclei, about an “axis of rotation’’ that is perpendic¬ 
ular to the molecular axis and passes through the center of mass of the molecule. Then, 
the two opposing positions of the molecular axis, namely the ones corresponding to the 
azimuthal angles 0 and cp + n, differ simply by an interchange of the two identical nuclei 
and, hence, correspond to only one distinct state of the molecule. Therefore, in the evalu¬ 
ation of the partition function, the range of the angle </> should be taken as (0, n) instead of 
the customary (0, 2n). Moreover, since the energy of rotational motion does not depend on 
angle 0, the only effect of this on the partition function of the molecule would be to reduce 
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it by a factor of 2. We thus obtain, in the classical approximation , n 

./nuc-rot(T) = (2Sa +1) 2 ■ (27) 

Zty/- 

Obviously, the factor 2 here will not affect the specific heat of the gas; in the classical 
approximation, therefore, the specific heat of a gas of homonuclear molecules is the same 
as that of a corresponding gas of heteronuclear molecules. 

In contrast, significant changes result at relatively lower temperatures where the states 
of rotational motion have to be treated as discrete. These changes arise from the cou¬ 
pling between the nuclear and the rotational states that in turn arises from the symmetry 
character of the nuclear-rotational wavefunction. As discussed in Section 5.4, the total 
wavefunction of a physical state must be either symmetric or antisymmetric (depend¬ 
ing on the statistics obeyed by the particles involved) with respect to an interchange of 
two identical particles. Now, the rotational wavefunction of a diatomic molecule is sym¬ 
metric or antisymmetric depending on whether the quantum number l is even or odd. 
The nuclear wavefunction, on the other hand, consists of a linear combination of the spin 
functions of the two nuclei and its symmetry character depends on the manner in which 
the combination is formed. It is not difficult to see that, of the (2 Sa + l) 2 different com¬ 
binations that one constructs, exactly (Sa + 1)(2 Sa + 1) are symmetric with respect to an 
interchange of the nuclei and the remaining Sa(2Sa + 1) antisymmetric. 12 In constructing 
the total wavefunction, as a product of the nuclear and the rotational wavefunctions, we 
then proceed as follows: 

(i) If the nuclei are fermions (Sa — ■ • ■), as in the molecule H 2 , the total wavefunction 

must be antisymmetric. To secure this, we may associate any one of the Sa(2Sa + 1) 
antisymmetric nuclear wavefunctions with any one of the even-Z rotational 
wavefunctions or any one of the (Sa + 1)(2Sa + 1) symmetric nuclear wavefunctions 
with any one of the odd-Z rotational wavefunctions. Accordingly, the nuclear- 
rotational partition function of such a molecule would be 

fnuc-'rod 7 ’) = Sa(2Sa + l)r ev en + (S A + 1)(2$a + l)r odd , (28) 


11 It seems instructive to outline here the purely classical derivation of the rotational partition function. Specifying 
the rotation of the molecule by the angles W,<l>) and the corresponding momenta ( pg,p'$), the kinetic energy assumes the 
form 


from which 


= tip! + ^r e p 2 r 


jrat(T)= A/e e ‘° t / kT (dp e dp4 > ddd(l»= / d<j>. 

o 

For heteronuclear molecules 0 max = 2 tt , while for homonuclear ones 0 max = n. 

12 See, for example, Schiff (1968, Section 41). 
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where 


r even — E (21 + 1) exp{—Z(Z + l)@ r /T} (29) 

1 = 0 , 2 ,... 

and 

OO 

f'odd — E (2/+l)exp{-/(/+l)0 r /r}. (30) 

/= 1,3,... 

(ii) If the nuclei are bosons (Sa = 0,1,2,...), as in the molecule D 2 , the total wavefunction 
must be symmetric. To secure this, we may associate any one of the (Sa + 1)(2Sa + 1) 
symmetric nuclear wavefunctions with any one of the even-Z rotational wavefunc- 
tions or any one of the Sa(2Sa + 1) antisymmetric nuclear wavefunctions with any 
one of the odd-Z rotational wavefunctions. We then have 


Znuc-rot^) = &A + 1)(2S A + l)r ev en + S A (2S A + l)r odd . (31) 

At high temperatures, it is the larger values of Z that contribute most to the sums (29) and 
(30). The difference between the two sums is then negligibly small, and we have 

r even — Puld — = T/ 2@ r ; (32) 

see equations (16) and (17). Consequently, 


Znuc-rot -ZnuEot = (2Sa + l) 2 T/2& r , (33) 

in agreement with our previous result (27) . Under these circumstances, the statistics gov¬ 
erning the nuclei does not make a significant difference to the thermodynamic behaviour 
of the gas. 

Things change when the temperature of the gas is in a range comparable to the value 
of © r . It seems most reasonable then to regard the gas as a mixture of two components, 
generally referred to as ortho- and para-, whose relative concentrations in equilibrium are 
determined by the relative magnitudes of the two parts of the partition function (28) or 
(31), as the case may be. Customarily, the name ortho- is given to that component that 
carries the larger statistical weight. Thus, in the case of fermions (as in H 2 ), the ortho- to 
para-ratio is given by 


(F.D.) _ (Z>A + l)r 0 dd 
3/1 r even 

while in the case of bosons (as in D 2 ), the corresponding ratio is given by 


„(B.E.) 


(Sa + Uneven 


(34) 


3/1 Pidti 


(35) 
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As temperature rises, the factor r 0 ^/r even tends to unity and the ratio n, in each case, 
approaches the temperature-independent value (Ah + 1)/Sa. In the case of H 2 , this lim¬ 
iting value is 3 (since ,Sh = \) while in the case of D 2 it is 2 (since Sa = 1). At sufficiently 
low temperatures, one may retain only the main terms of the sums (29) and (30), with the 
result that 



(36) 


which tends to zero as T ->• 0. The ratio n then tends to zero in the case of fermions and 
to infinity in the case of bosons. Hence, as T -»• 0, the hydrogen gas is wholly para-, while 
deuterium is wholly ortho-; of course, in each case, the molecules do settle down in the 
rotational state 1 = 0. 

At intermediate temperatures, one has to work with the equilibrium ratio (34), or (35), 
and with the composite partition function (28), or (31), in order to compute the thermody¬ 
namic properties of the gas. One finds, however, that the theoretical results so derived do 
not generally agree with the ones obtained experimentally. This discrepancy was resolved 
by Dennison (1927) who pointed out that the samples of hydrogen, or deuterium, ordi¬ 
narily subjected to experiment are not in thermal equilibrium as regards the relative 
magnitudes of the ortho- and para-components. These samples are ordinarily prepared 
and kept at room temperatures that are well above © r , with the result that the ortho- to 
para-ratio in them is very nearly equal to the limiting value (Sa + 1)Sa- 

If now the temperature is lowered, one would expect this ratio to change in accordance 
with equation (34), or (35). However, it does not do so for the following reason. Since 
the transition of a molecule from one form of existence to another involves the flipping 
of the spin of one of its nuclei, the transition probability of the process is quite small. Actu¬ 
ally, the periods involved are of the order of a year! Obviously, one cannot expect to attain 
the true equilibrium ratio n during the short times available. Consequently, even at lower 
temperatures, what one generally has is a nonequilibrium mixture of two independent 
substances, the relative concentration of which is preassigned. The partition functions (28) 
and (31) as such are, therefore, inapplicable; we rather have directly for the specific heat 



(37) 


and 



(38) 


where 


Ceven/odd — Nk j T 2 (3/9 T) In feyejj/odd J 


(39) 


We have, therefore, to compute C eV en and C 0 dd separately and then derive the net value 
of the rotational specific heat with the help of formula (37) or (38), as the case may be. 
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FIGURE 6.7 The theoretical specific heat of a 1:3 mixture of para-hydrogen and ortho-hydrogen. The experimental 
points originate from various sources listed in Wannier (1966). 


Figure 6.7 shows the relevant results for hydrogen. Curves 1 and 2 correspond to the para- 
hydrogen (Ceven) and the ortho-hydrogen (C 0 dd)> respectively, while curve 3 represents the 
weighted mean, as given by equation (37) . The experimental results are also shown in the 
figure; the agreement between theory and experiment is clearly good. 

Further evidence in favor of Dennison’s explanation is obtained by performing exper¬ 
iments with ortho-para mixtures of different relative concentration. This can be done by 
speeding up the ortho-para conversion by passing hydrogen over activated charcoal. By 
doing this at various temperatures, and afterwards removing the catalyst, one can fix the 
ratio n at any desired value. The specific heat then follows a curve obtained by mixing C eV en 
and C 0 dd with appropriate weight factors. Further, if one measures the specific heat of the 
gas in such a way that the ratio n, at every temperature T, has the value that is given by 
formula (34), it indeed follows the curve obtained from expression (28) for the partition 
function. 

6.5.C Polyatomic molecules 

Once again, the translational degrees of freedom of the molecules contribute their usual 
share, | /c per molecule, toward the specific heat of the gas. As regards the lowest electronic 
state, it is, in most cases, far below any of the excited states; nevertheless, it generally pos¬ 
sesses a multiplicity (depending on the orbital and spin angular momenta of the state) 
that can be taken care of by a degeneracy factor g e . As regards the rotational states, they 
can be treated classically because the large values of the moments of inertia characteris¬ 
tic of polyatomic molecules make the quantum of rotational energy, h 2 /2Ij, much smaller 
than the thermal energy kT at practically all temperatures of interest. Consequently, the 
interaction between the rotational states and the states of the nuclei can also be treated 
classically. As a result, the nuclear-rotational partition function is given by the product of 
the respective partition functions, divided by a symmetry number y that denotes the num¬ 
ber of physically indistinguishable configurations realized during one complete rotation of 
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the molecule: 13 


7 nuc—rot (T) 


gnucjmt(T) _ 

y 


(40) 


compare to equation (27). Her e,j^ 0 t (T) is the rotational partition function of the molecule 
evaluated in the classical approximation (without paying regard to the presence of identi¬ 
cal nuclei, if any); it is given by 


7rot( T ) = 71 


1/2 


/2/i fcr\ 1/2 

{~hT ) 


( 2i 2 kT y 2 


^ 2 hkr y 2 


(41) 


where I\, h, and I 3 are the principal moments of inertia of the molecule; see Prob¬ 
lem 6.27. 14 The rotational specific heat is then given by 


Got = Nk A j T 2 A lnjg t (T) | = | Nk , 


(42) 


consistent with the equipartition theorem. 

As regards vibrational states, we first note that, unlike a diatomic molecule, a poly¬ 
atomic molecule has not one but several vibrational degrees of freedom. In particular, a 
noncollinear molecule consisting of n atoms has 3n — 6 vibrational degrees of freedom, six 
degrees of freedom out of the total 3 n having gone into the translational and rotational 
motions. On the other hand, a collinear molecule consisting of n atoms would have 3n — 5 
vibrational degrees of freedom, for the rotational motion in this case has only two, not 
three, degrees of freedom. The vibrational degrees of freedom correspond to a set of nor¬ 
mal modes characterized by a set of frequencies It might happen that some of these 
frequencies have identical values; we then speak of degenerate frequencies . 15 

In the harmonic approximation, these normal modes may be treated independently 
of one another. The vibrational partition function of the molecule is then given by the 
product of the partition functions corresponding to individual normal modes, that is, 


jv ib(T) — J~[ 
i 


e -@i/2T 



(43) 


13 For example, the symmetry number y for H 2 O (isosceles triangle) is 2, for NH 3 (regular triangular pyramid) it is 3, 
while for CH 4 (tetrahedron) and CeHg (regular hexagon) it is 12. For heteronuclear molecules, the symmetry number is 
unity. 

14 In the case of a collinear molecule, such as N 2 O or CO 2 , there are only two degrees of freedom for rotation; con¬ 
sequently, jf ot (T) is given by (2IkT /ft 2 ), where I is the (common) value of the two moments of inertia of the molecule; 
see equation (17). Of course, we must also take into account the symmetry number y. In the examples quoted here, 
the molecule N 2 O, being spatially asymmetric (NNO), has symmetry number 1, while the molecule CO 2 , being spatially 
symmetric (OCO), has symmetry number 2. 

15 For example, of the four frequencies characterizing the normal modes of vibration of the collinear molecule OCO, 

t 

two that correspond to the (transverse) bending modes, namely o C O’ are equal while the others that correspond to 
(longitudinal) oscillations along the molecular axis, namely ^O ^O and ^O C O^, are different; see Problem 6.28. 
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and the vibrational specific heat is given by the sum of the contributions arising from the 
individual modes: 


Cvib=M-]T 




P ®i/T 


(e*i/ T -iy 


(44) 


In general, the various (-), are of order 10 3 K; for instance, in the case of CO 2 , which was 
cited in footnote 15, ©1 = © 2 = 960 K, ©3 = 1,990 K, and ©4 = 3,510 K. For temperatures 
large in comparison with all ©,-, the specific heat would be given by the equipartition value, 
namely Nk for each of the normal modes. In practice, however, this limit can hardly be 
realized because the polyatomic molecules generally break up well before such high tem¬ 
peratures are reached. Secondly, the different frequencies op of a polyatomic molecule 
are generally spread over a rather wide range of values. Consequently, as temperature 
rises, different modes of vibration get gradually “included” into the process; in between 
these “inclusions,” the specific heat of the gas may stay constant over considerably large 
stretches of temperature. 


6.6 Chemical equilibrium 

The equilibrium amounts of chemicals in a chemical reaction are determined by the 
chemical potentials of each of the species. Consider the following chemical reaction 
between chemical species 71 and B to form species X and Y with stoichiometric coefficients 
va, vb, v x , and v Y : 


vaA+v b B<=±vxX+vyY. ( 1 ) 

Each individual reaction that occurs changes the number of molecules of each species 
according to the stoichiometric coefficients. If the initial numbers of molecules of the 
species are N®, /Vjj, N®, and Ny, then the numbers of each species after AN chemical reac¬ 
tions have occurred would be Na = Ny — v^AN, Ng = /Vjj — v B AN, N x — N® + v x AN, and 
N y = N° + v y AN. If AN > 0 , the reaction has proceeded in the positive direction increas¬ 
ing the numbers of X and Y. If AN < 0, the reaction has proceeded in the direction of 
increasing the numbers of A and B. If the reaction takes place in a closed isothermal sys¬ 
tem with fixed pressure, the Gibbs free energy G(Na,Nb,N x ,N y ,P,T ) is changed by the 
amount 


AG = (—VAfiA — I’BMB + vxf-X + vypy)AN, (2) 

where pa — (• mx) Tp i s chemical potential of species A, and so on; see Sections 3.3, 

4.7, and Appendix H. Since the Gibbs free energy decreases as a system approaches equi¬ 
librium, AG< 0. When the system reaches chemical equilibrium, the Gibbs free energy 
reaches its minimum value so AG = 0. This gives us the general relationship for chemical 
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equilibrium of the reaction in equation (1), namely 


v AMj4 + VBfJ-B = VxlJ-X + VyfJ-Y- (3) 

Note that if a chemical species that acts as a catalyst is added in equal amounts to 
both sides of equation (1), the equilibrium relation (3) is unaffected. Therefore, a cata¬ 
lyst may serve to increase the rate of approach toward equilibrium, without affecting the 
equilibrium condition itself. 

If the free energy can be approximated as a sum of the free energies of the individual 
species such as in an ideal gas or a dilute solution, then we can derive a simple rela¬ 
tion between the equilibrium densities of the species. Following from equations (3.5.10) 
and (6.5.4), the Helmholtz free energy of a classical ideal gas consisting of molecules with 
internal degrees of freedom can be written as 

/ N)J* \ 

A(N, V, T) = Ne + NkTln I — 1 - NkT - NkT In j(T), (4) 


where e is the ground state energy of the molecule, X = h/y/2nmkT is the thermal 
deBroglie wavelength, and j(T) is the partition function for the internal degrees of freedom 
of the molecule. This gives for the chemical potential of species A 


BA = 


= ( d k\ = 


e A + kTln(n A X^\ -kTlnj A (T), 


\ SN A J j y 

where n A is the number density of species A. The equilibrium condition then (3) gives 

[X] VX [Y] VY 
[A] v a[B]"b 

where [A] = n A /no, and so on, 


(5) 


: K(T) — exp /SA/x (0) ^ , 


( 6 ) 


A ( 0 ) ( 0 ) , ( 0 ) ( 0 ) (0) , 

Am ’ = i’xB x + v yb y ~ v aB a ~vb^ b , (7a) 

= s A + kTIn (noX^j - kT\nj A (T), etc. (7b) 

and K(T) is the equilibrium constant. 

Equation (6) is called the law of mass action. The quantity no is a standard number 
density and b a \ and so on, are the chemical potentials of the species at temperature T 
and standard number density hq. The quantity A/i a]> represents the Gibbs free energy 
change per chemical reaction at standard density. Note that the reaction constant K(T) 
is a function only of the temperature and determines the densities of the components in 
equilibrium at temperature T through equation (6). The standard number density for gases 
is usually chosen to be the number density of an ideal gas at temperature T and standard 
pressure, that is, no — (latm )/kT. The standard density for aqueous solutions is usually 
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chosen to be one mole per liter. The Gibbs free energy in chemical tables is expressed 
relative to the standard states of the elements. 

We now examine a specific example, the combustion of hydrocarbons with oxygen in 
an internal combustion engine. The reaction used to power clean buses and automobiles 
using natural gas is 


CH 4 + 2 O 2 CO 2 + 2 H 2 O . ( 8 ) 

The primary reaction products are carbon dioxide and water vapor but carbon monoxide 
is also produced by the reaction 


2 CH 4 + 3 O 2 ^2C0 + 4H 2 0. (9) 

A primary goal for a clean burning engine is to combust nearly all the hydrocarbon fuel 
while producing as little carbon monoxide as possible. By combining reactions (8) and (9), 
we get a direct reaction between carbon monoxide, oxygen, and carbon dioxide: 

2CO + 0 2 2C0 2 . (10) 

Equation (6) now gives the equilibrium ratio of CO to CO 2 as 


[CO] 

[C0 2 ] 

At T « 1,500K, as combustion occurs inside the cylinder of the engine, the equilibrium 
constant K ~ 10 10 so carbon monoxide is present as a combustion product in the few 
parts per million range — combustion reactions that are not in equilibrium can have CO 
concentrations well above the equilibrium value. The exhaust gases cool quickly during 
the power stroke of the engine. As these gases exit the exhaust at T ^ 600 K, the equilib¬ 
rium constant K ~ 10 40 , which should result in almost no carbon monoxide in the exhaust 
stream. However, the reaction rate is typically too slow to keep the CO concentration in 
chemical equilibrium during the rapid cooling, so the amount of CO present in the exhaust 
stream remains close to the larger value determined at the higher temperature. 16 Fortu¬ 
nately, the leftover carbon monoxide can be converted into carbon dioxide at the exhaust 
temperature in a catalytic converter that uses platinum and palladium as catalysts to 
increase the reaction rate. Equation (11) indicates that the carbon monoxide fraction is 
reduced by increasing the amount of oxygen present in the reaction. This is accomplished 
by running the engine with a hydrocarbon/air ratio that is a little bit short of the stoichio¬ 
metric point of equation (8) . This reduces the amount of CO left from the combustion itself 
and also leaves excess O 2 in the exhaust stream for use in the catalytic converter. 

16 Very similar effects happened during the early stages of the universe as the temperature cooled but the cooling rate 
was too rapid for some constituents to remain in thermal equilibrium; see Chapter 9. 


1 


^(T)[0 2 ] 


( 11 ) 
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Problems 

6.1. Show that the entropy of an ideal gas in thermal equilibrium is given by the formula 

S = k^2 [( n e + 1) in (n E + 1) - (n E ) ln{n £ )] 

£ 

in the case of bosons and by the formula 

S=Jfc£Hl- n E ) ln(l - n e ) - {n e )\n{n E )\ 


in the case of fermions. Verify that these results are consistent with the general formula 


J2p^n)lnp e (n) \, 


s=-fcE 


where p E (n) is the probability that there are exactly n particles in the energy state s. 

6.2. Derive, for all three statistics, the relevant expressions for the quantity («j?) — (n E ) 2 from the 
respective probabilities p E (n). Show that, quite generally, 


(■ n 2 E ) - (n E ) 2 = kT 


a {n E y 
ap / t 


compare with the corresponding result, (4.5.3), for a system embedded in a grand canonical 
ensemble. 

6.3. Refer to Section 6.2 and show that, if the occupation number n E of an energy level e is restricted 
to the values 0,1,...,!, then the mean occupation number of that level is given by 


{ n E ) = 


1 


/+ 1 


z~ 1 e^ E — 1 (z~ 1 eP e ) l+l — 1 


Check that while / = 1 leads to (n E )f.d.> I -*■ °° leads to (n E ) b.e.- 
6.4. The potential energy of a system of charged particles, characterized by particle charge e and 
number density n(r), is given by 


U = 


II 


n(r)n(r') 

\r-r’\ 


drdr' 


'*‘1 


n(r)<p ext (r)dr, 


where <p ex t(r) is the potential of an external electric field. Assume that the entropy of the system, 
apart from an additive constant, is given by the formula 


S=-k I n(r)]nn(r)dr; 

compare to formula (3.3.13). Using these expressions, derive the equilibrium equations satisfied 
by the number density n(r) and the total potential cj> (r), the latter being 


<£ext (r) + e 


/ 


n(r’) 

\r-r>Y 


dr'. 


6.5. Show that the root-mean-square deviation in the molecular energy e, in a system obeying 
Maxwell-Boltzmann distribution, is ^(2/3) times the mean molecular energy e. Compare this 
result with that of Problem 3.18. 

6.6. Show that, for any law of distribution of molecular speeds, 


<“>(- 


> 1 . 


Check that the value of this quantity for the Maxwellian distribution is 4 /jt. 
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6.7. Through a small window in a furnace, which contains a gas at a high temperature T, the spectral 
lines emitted by the gas molecules are observed. Because of molecular motions, each spectral line 
exhibits Doppler broadening. Show that the variation of the relative intensity /(A) with wavelength 
A. in a line is given by 


/(A) a exp 


me 2 (A — A 0 ) 2 
2A JicT J 


where m is the molecular mass, c the speed of light, and Ao the mean wavelength of the line. 

6.8. An ideal classical gas composed of N particles, each of mass in, is enclosed in a vertical cylinder of 
height L placed in a uniform gravitational field (of acceleration g) and is in thermal equilibrium; 
ultimately, both N and L ->■ oo. Evaluate the partition function of the gas and derive expressions for 
its major thermodynamic properties. Explain why the specific heat of this system is larger than that 
of a corresponding system in free space. 

6.9. Centrifuge-based uranium enrichment: Natural uranium is composed of two isotopes: 238 U and 
235 U, with percentages of 99.27% and 0.72%, respectively. If uranium hexafluoride gas UF6 is 
injected into a rapidly spinning hollow metal cylinder with inner radius R, the equilibrium pressure 
of the gas is largest at the inner radius and isotopic concentration differences between the axis and 
the inner radius allow enrichment of the concentration of 235 U. 

(a) Write down the Lagrangian <£({<7^ Rk)) for particles of mass m moving in a cylindrical 
coordinate system rotating at angular velocity w and use a Legendre transformation 

&({<1k>Pk}) = '22Pkqk-'£f 

k 


to show that the one-particle ffamiltonian ,H in that cylindrical coordinate system is 


9t{r,0,z,p r ,pe,Pz) 


2 m 


(Pg - mr 2 w) 2 

2 mr 2 2m 


Ignore the internal degrees of freedom of the molecules since they will not affect the density as 
a function of position. Show that the one-particle partition function shown here can be written 
as 


Qi(V,T) = 


OO OO OO K Z.TZ ti 

p J dp r J dp g J dp z J dr J dd J 


dzexp(-fi3 {), 


by constructing the Jacobian of transformation between the cartesian and the cylindrical 
coordinates for the phase space integral. Evaluate the partition function Qi in a closed form 
and determine the Elelmholtz free energy of this system. 

(b) Determine the number density n(r) as a function of the distance r from the axis for the N 
molecules of gas in the rotating cylinder. Show that, in the limit a> -» 0, the density becomes 
uniform with the value n = N/nR 2 H. Find an expression for the ratio of the pressure at the 
inner radius of the cylinder R to the pressure at the axis of the cylinder as a function of w 
and R. 

(c) Evaluate the pressure ratios for the two isotopically different UF6 gases at room temperature 
for the case wR = 500 m/s. Show that the pressure ratio for 238 U is approximately 20% larger 
than the pressure ratio for 235 U so that extracting gas near the axis results in an enriched 
concentration of 235 U. A series of centrifuges can be used to raise the concentration of 235 U 
to create a fissionable grade of uranium for use in power-generating reactors or in nuclear 
weapons. Not surprisingly, this technology is a major concern for possible nuclear 
proliferation. 
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6.10. (a) Show that, if the temperature is uniform, the pressure of a classical gas in a uniform 

gravitational field decreases with height according to the barometric formula 

P(z) = P( 0) exp {— mgz/kT], 

where the various symbols have their usual meanings. 17 
(b) Derive the corresponding formula for an adiabatic atmosphere, that is, the one in which 
(. PV r ), rather than (PV), stays constant. Also study the variation, with height, of the 
temperature T and the density n in such an atmosphere. 

6.11. (a) Show that the momentum distribution of particles in a relativistic Boltzmannian gas, with 

e = clp 2 + m^c 2 ) 1 / 2 , is given by 


fip)dp=Ce ^ c( p 2 +»!o' r ) 1/2 p 2 dp, 


with the normalization constant 


C = 


m 2 cK 2 (pm 0 c 2 )’ 


K v (z) being a modified Bessel function. 

(b) Check that in the nonrelativistic limit ( kT <g: moc 2 ) we recover the Maxwellian distribution, 


f(p)dp = 




while in the extreme relativistic limit (kT m 0 c 2 ) we obtain 


(c) Verify that, quite generally, 


f(p)dp= ^p—e ppc (4np 2 dp). 

07 T 


(pu) = 3kT. 


6.12. (a) Considering the loss of translational energy suffered by the molecules of a gas on reflection 

from a receding wall, derive, for a quasistatic adiabatic expansion of an ideal nonrelativistic 
gas, the well-known relation 

PV Y = const., 

where y = (3a + 2)/3 a, a being the ratio of the total energy to the translational energy of 
the gas. 

Show that, in the case of an extreme relativistic gas, y = (3a + 1) /3a. 

6.13. (a) Determine the number of impacts made by gas molecules on a unit area of the wall in a unit 
time for which the angle of incidence lies between 9 and 9 + d9. 

Determine the number of impacts made by gas molecules on a unit area of the wall in a unit 
time for which the speed of the molecules lies between u and u + du. 

A molecule AB dissociates if it hits the surface of a solid catalyst with a normal translational 
energy greater than 10 19 J. Show that the rate of the dissociative reaction AB -> A + B is more 
than doubled by raising the temperature of the gas from 300 K to 310 K. 

6.14. Consider the effusion of molecules of a Maxwellian gas through an opening of area a in the walls of 
a vessel of volume V. 

(a) Show that, while the molecules inside the vessel have a mean kinetic energy | kT, the effused 
ones have a mean kinetic energy 2 kT, T being the quasistatic equilibrium temperature of 
the gas. 


(b) 

(a) 

(b) 

(c) 


7 This formula was first given by Boltzmann (1879). For a critical study of its derivation, see Walton (1969). 
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(b) Assuming that the effusion is so slow that the gas inside is always in a state of quasistatic 

equilibrium, determine the manner in which the density, the temperature, and the pressure of 
the gas vary with time. 

6.15. A polyethylene balloon at an altitude of 30,000 m is filled with helium gas at a pressure of 10 ' 2 atm 
and a temperature of 300 K. The balloon has a diameter of 10 m, and has numerous pinholes of 
diameter 10 5 m each. How many pinholes per square meter of the surface of the balloon must 
there be if 1 percent of the gas were to leak out in 1 hour? 

6.16. Consider two Boltzmannian gases A and B, at pressures Pa and Pb and temperatures 7 '4 and T B} 
respectively, contained in two regions of space that communicate through a very narrow opening 
in the partitioning wall; see Figure 6 . 8 . Show that the dynamic equilibrium resulting from the 
mutual effusion of the two kinds of molecules satisfies the condition 

Pa/Pb = {m A T A /m B T B ) l/2 , 

rather than P A = P B [which would be the case if the equilibrium had resulted from a hydrodynamic 
flow). 



A 

B 



(Pa, T a ) 

(Pb, T b ) 



FIGURE 6.8 The molecules of the gases A and B undergoing a two-way effusion. 


6.17. A small sphere, with initial temperature T, is immersed in an ideal Boltzmannian gas at 
temperature To. Assuming that the molecules incident on the sphere are first absorbed and then 
reemitted with the temperature of the sphere, determine the variation of the temperature of the 
sphere with time. 

[Note: The radius of the sphere may be assumed to be much smaller than the mean free path of the 
molecules.] 

6.18. Show that the mean value of the relative speed of two molecules in a Maxwellian gas is ^2 times 
the mean speed of a molecule with respect to the walls of the container. 

[Note: A similar result for the root-mean-square speeds (instead of the mean speeds) holds under 
much more general conditions.] 

6.19. What is the probability that two molecules picked at random from a Maxwellian gas will have a 
total energy between E and E + dEl Verify that (E) = 3 kT. 

6.20. The energy difference between the lowest electronic state 1 So and the first excited state 3 Si of the 
helium atom is 159,843 cm' 1 . Evaluate the relative fraction of the excited atoms in a sample of 
helium gas at a temperature of 6000 K. 

6.21. Derive an expression for the equilibrium constant K(T) for the reaction H 2 + D 2 -o- 2HD at 
temperatures high enough to allow classical approximation for the rotational motion of the 
molecules. Show that K(o o) = 4. 

6.22. With the help of the Euler-Maclaurin formula (6.5.19), derive high-temperature expansions for 
Teven and r Q( j(j, as defined by equations (6.5.29) and (6.5.30), and obtain corresponding expansions 
for Ceven and C' 0( ld > as defined by equation (6.5.39). Compare the mathematical trend of these 
results with the nature of the corresponding curves in Figure 6.7. Also study the low-temperature 
behavior of the two specific heats and once again compare your results with the relevant parts of 
the aforementioned curves. 

6.23. The potential energy between the atoms of a hydrogen molecule is given by the (semiempirical) 
Morse potential 


V(r) = V 0 {e~ 2(r ~ ro)la - 2e~ {r ~ ro)/a } 
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where Vo = 7 x 10" 12 erg, tq = 8 x 10" 9 cm, and a = 5 x 10" 9 cm. Evaluate the rotational and 
vibrational quanta of energy, and estimate the temperatures at which the rotational and vibrational 
modes of the molecules would begin to contribute toward the specific heat of the hydrogen gas. 

6.24. Show that the fractional change in the equilibrium value of the internuclear distance of a diatomic 
molecule, as a result of rotation, is given by 



here, a> is the angular frequency of the vibrational state in which the molecule happens to be. 
Estimate the numerical value of this fraction in a typical case. 

6.25. The ground state of an oxygen atom is a triplet, with the following/; ne structure: 


ej =2 = e/=i — 158.5 cm 1 = e/ = o — 226.5 cm l . 


Calculate the relative fractions of the atoms occupying different /-levels in a sample of atomic 
oxygen at 300 K. 

6.26. Calculate the contribution of the first excited electronic state, namely 1 A with g e = 2, of the O 2 
molecule toward the Helmholtz free energy and the specific heat of oxygen gas at a temperature 
of 5000 K; the separation of this state from the ground state, namely 3 E with g e = 3, is 7824 cm" 1 . 
How would these results be affected if the parameters & r and ©„ of the O 2 molecule had different 
values in the two electronic states? 

6.27. The rotational kinetic energy of a rotator with three degrees of freedom can be written as 


Mf M 2 M 2 


£ rot — 



where (£, ig f) are coordinates in a rotating frame of reference whose axes coincide with the 
principal axes of the rotator, while ) are the corresponding angular momenta. Carrying 

out integrations in the phase space of the rotator, derive expression (6.5.41) for the partition 
function j m t(T) in the classical approximation. 

6.28. Determine the translational, rotational, and vibrational contributions toward the molar entropy 
and the molar specific heat of carbon dioxide at NTP. Assume the ideal-gas formulae and use 
the following data: molecular weight M = 44.01; moment of inertia / of a CO 2 molecule = 71.67 x 
10" 4 ° gem 2 ; wave numbers of the various modes of vibration: vf = v 2 = 667.3 cm" 1 , v 3 = 

1383.3 cm" 1 , and 1 J 4 = 2439.3 cm" 1 . 

6.29. Determine the molar specific heat of ammonia at a temperature of 300 K. Assume the ideal-gas 
formula and use the following data: the principal moments of inertia: Iy = 4.44 x 10" 4 °gcm 2 , 

I 2 =I 3 = 2.816 x 10" 4 °gcm 2 ; wave numbers of the various modes of vibration: 

v\ = v 2 = 3336 cm" 1 , v 3 = v 4 = 950 cm" 1 , v 5 = 3414 cm" 1 , and v 6 = 1627 cm" 1 . 

6.30. Derive the equilibrium concentration equation ( 6 . 6 . 6 ) from the equilibrium condition (6.6.3). 

6.31. Use the following values to determine the equilibrium constant for the reaction 2CO + 0 2 ^ 2 CO 2 . 

At a combustion temperature of T = 1500 K: Pf-con = “60.95, Pn^g = —35.18, and P^q 2 = —27.08. 
Use this data to compute the fraction [CO]/[C0 2 ] for the case of [0 2 ] = 0.01. Repeat for a catalytic 
converter temperature of T = 600K, where PHqo 2 = “ 103.45, = -45.38, and Pi^q 2 = —23.49. 

6.32. Derive an expression for the equilibrium constant K(T) for the reaction N 2 + O 2 <=* 2NO 
in terms of the ground state energy change Aeq = 2eno — £n 2 — eo 2 and the vibrational and 
rotational partition functions of the diatomic molecules, using results from Section 6.5. Give 
predictions for the ranges of temperatures where the rotational modes are classically excited but 
the vibration modes are suppressed and for higher temperatures where both the rotational and 
vibrational models are classically excited. 

6.33. Analyze the combustion reaction 
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assuming that at combustion temperatures the equilibrium constant K(T) s> 1. Show that 
conducting combustion at the stoichiometric point or just a bit short of the stoichiometric point 
(so there is enough oxygen to oxidize all of the methane) will lead to low amounts of CH 4 in the 
exhaust. Determine the equilibrium amount of CH 4 in terms of the initial excess amount of O 2 . 
Determine the equilibrium constant at T = 1500K from the data Pi^qq 2 = —60.95, /3/r^ = -27.08, 
= - 31 - 95 - and pn ( ^ Q = -44.62. 

6.34. Determine the equilibrium ionization fraction for the reaction 

Na <=* Na + + e~ 

in a sodium vapor. Treat all three species as ideal classical monatomic gases. The ionization energy 
of sodium is 5.139 eVj Na + ions are spin-zero, and neutral Na and free e are both spin-1. Derive 
the Saha equation for the ionized fraction [Na+]/([Na] + [Na + ]) for a neutral plasma as a function 
of temperature at a fixed total density. Plot the ionized fraction as a function of temperature for 
some chosen total density. 

[Note that, this calculation is very similar to the one concerning ionized hydrogen fraction as a 
function of temperature during the recombination era in the early universe; see Section 9.8.] 




Ideal Bose Systems 


In continuation of Sections 6.1 through 6.3, we shall now investigate in detail the physical 
behavior of a class of systems in which, while the intermolecular interactions are still neg¬ 
ligible, the effects of quantum statistics (which arise from the indistinguishability of the 
particles) assume an increasingly important role. This means that the temperature T and 
the particle density n of the system no longer conform to the criterion 


nX 3 = 


nh 3 


(. 2nmkT ) 3 / 2 


« 1 , 


(5.5.20) 


where A.{= h/(2 jt mkT) }! ' 2 \ is the mean thermal wavelength or thermal deBroglie wave¬ 
length of the particles. In fact, the quantity nX 3 turns out to be a very appropriate 
parameter, in terms of which the various physical properties of the system can be ade¬ 
quately expressed. In the limit nX 3 —> 0, all physical properties go over smoothly to their 
classical counterparts. For small, but not negligible, values of nX 3 , the various quantities 
pertaining to the system can be expanded as power series in this parameter; from these 
expansions one obtains the first glimpse of the manner in which departure from classi¬ 
cal behavior sets in. When nX 3 becomes of the order of unity, the behavior of the system 
becomes significantly different from the classical one and is characterized by quantum 
effects. A study of the system under these circumstances brings us face to face with a set of 
phenomena unknown in classical statistics. 

It is evident that a system is more likely to display quantum behavior when it is at a 
relatively low temperature and/or has a relatively high density of particles. 1 Moreover, the 
smaller the particle mass the larger the quantum effects. 

Now, when nX 3 is of the order of unity, then not only does the behavior of a system 
exhibit significant departure from typical classical behavior but it is also influenced by 
whether the particles constituting the system obey Bose-Einstein statistics or Fermi-Dirac 
statistics. Under these circumstances, the properties of the two kinds of systems are them¬ 
selves very different. In the present chapter we consider systems belonging to the first 
category while the succeeding chapter will deal with systems belonging to the second 
category. 


'Actually it is the ratio n/r 3 / 2 , rather than the quantities n and T separately, that determines the degree of degeneracy 
in a given system. For instance, white dwarf stars, even at temperatures of order 10 7 K, constitute statistically degenerate 
systems; see Section 8.5. 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00007-4 
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7.1 Thermodynamic behavior of an ideal Bose gas 

We obtained, in Sections 6.1 and 6.2, the following formulae for an ideal Bose gas: 

PV 

-— = InO = — ^ln(l — ze~P e ) 


and 


N^ns) = J2 


1 

z~ 1 ef is — 1 ’ 


( 1 ) 


( 2 ) 


where p = 1 /kT, while z is the fugacity of the gas which is related to the chemical potential 
H through the formula 


z = exp (fi/kT); (3) 

as noted earlier, ze~^ E , for all e, is less than unity. In view of the fact that, for large V, the 
spectrum of the single-particle states is almost a continuous one, the summations on the 
right sides of equations (1) and (2) may be replaced by integrations. In doing so, we make 
use of the asymptotic expression (2.4.7) for the nonrelativistic density of states a(e) in the 
neighborhood of a given energy e, namely 2 

a(s)de = (2nV /h 3 ){2m) 3 ^ 2 e 1 ^ 2 ds. (4) 

We, however, note that by substituting this expression into our integrals we are inadver¬ 
tently giving a weight zero to the energy level e = 0. This is wrong because in a quantum- 
mechanical treatment we must give a statistical weight unity to each nondegenerate 
single-particle state in the system. It is, therefore, advisable to take this particular state 
out of the sum in question before carrying out the integration; for a rigorous justification 
of this (unusual) step, see Appendix F. We thus obtain 

OO 

^ = -^(2 m ) 3/2 J e 1 / 2 ln(l — ze~P E )de — T ln(l — z) (5) 

o 


and 


OO 

N 2n o/o f e l / 2 ds 1 z 

r = F (2m) I + 

0 


( 6 ) 


of course, the lower limit of these integrals can still be taken as 0 , because the state e = 0 is 
not going to contribute toward them anyway. 

Before proceeding further, a word about the relative importance of the last terms in 
equations (5) and ( 6 ). For z« 1, which corresponds to situations not far removed from 


2 The theory of this section is restricted to a system of nonrelativistic particles. For the more general case, see Kothari 
and Singh (1941) and Landsberg and Dunning-Davies (1965). 
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the classical limit, each of these terms is of order 1 /N and, therefore, negligible. How¬ 
ever, as z increases and assumes values close to unity, the term z/( 1 — z)V in (6), which 
is identically equal to Nq/V (No being the number of particles in the ground state e = 0), 
can well become a significant fraction of the quantity N/V; this accumulation of a macro¬ 
scopic fraction of the particles into a single state s = 0 leads to the phenomenon of 
Bose-Einstein condensation. Nevertheless, since z/{ 1 — z) — No and hence z = No/(No + 1), 
the term {—V~ l ln(l — z)} in (5) is equal to {V~ l ln(lVo + 1)}, which is at most 0(N~ y lnN); 
this term is, therefore, negligible for all values of z and hence may be dropped altogether. 

We now obtain from equations (5) and ( 6 ), on substituting fie = x, 

P_ = _ 2n(2mkT ) — j x i/2 ln{1 _ ze -x^ dx = J_ g5/2( 2 ) ( 7 ) 

0 


and 


oo 

N — No 2n(2mkT ) 3 / 2 f x l ! 2 dx 1 

~ V~ = P J z~ x e x - 1 = P g3/2(z) ’ 

o 


( 8 ) 


where 


X = h/(2nmkT) 1 l 2 , 

while g v (z) are Bose-Einstein functions defined by, see Appendix D, 

00 i o a 

1 f x l dx z 2 z 3 

z~ l e x - 1 ~~ + 2y + T J + '"’ 


(9) 


( 10 ) 


0 


note that to write (7) in terms of the function gb/i{z) we first carried out an integration by 
parts. Equations (7) and ( 8 ) are our basic results; on elimination of z, they would give us 
the equation of state of the system. 

The internal energy of this system is given by 


U = - 


d 

w 


in<aj =fcr 2 j 


d tPV 
dT \ kT 


z,V 


- k T 2v ^MMrd\- 3 2 kT i^r. 


(ID 


here, use has been made of equation (7) and of the fact that X ex T b 2 . Thus, quite 
generally, our system satisfies the relationship 

P=\(U/\ 0. (12) 

For small values of z, we can make use of expansion (10); at the same time, we can neglect 
N 0 in comparison with N. An elimination of z between equations (7) and ( 8 ) can then be 
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carried out by first inverting the series in (8) to obtain an expansion for z in powers of n)? 
and then substituting this expansion into the series appearing in (7). The equation of state 
thereby takes the form of the virial expansion, 


PV 

NkT 




(13) 


where v(= 1 /n) is the volume per particle; the coefficients a/, which are referred to as the 
virial coefficients of the system, turn out to be 


a.\ = 1, 

1 

a 2 = -- = -0.17678, 

4^2 

a 3 =-( —- - 1 | = -0.00330, 

\9\/3 8/ 

/ 3 5 1 \ 

a 4 = - +-=-- = -0.00011, 

\32 32V2 2^6/ 

and so on. For the specific heat of the gas, we obtain 


Cy 

Nk 




3U\ 

)N,V 

5-3 1 
— 7 .— a l\ 



3 

3T 
l-i 



V 


1 + 0.0884| p j + 0.0066( p | + 0.0004 



(14) 


(15) 


As T -»• 00 (and hence X -»• 0), both the pressure and the specific heat of the gas approach 
their classical values, namely nkT and | Nk, respectively. We also note that at finite, but 
large, temperatures the specific heat of the gas is larger than its limiting value; in other 
words, the (Cy, 7>curve has a negative slope at high temperatures. On the other hand, as 
T 0, the specific heat must go to zero. Consequently, it must pass through a maximum 
somewhere. As seen later, this maximum is in the nature of a cusp that appears at a criti¬ 
cal temperature T c ; the derivative of the specific heat is found to be discontinuous at this 
temperature (see Figure 7.4 later in this section). 

As the temperature of the system falls (and the value of the parameter '/■? /v grows), 
expansions such as (13) and (15) do not remain useful. We then have to work with formulae 
(7), (8), and (11) as such. The precise value of z is now obtained from equation (8), which 
may be rewritten as 


(. 2 nmkT) 3 / 2 

N e = V -p- g 3 / 2 (z), 


( 16 ) 
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where N e is the number of particles in the excited states (e / 0); of course, unless z gets 
extremely close to unity, JV e ~lV. 3 It is obvious that, for 0 < z < I, the function gs/ 2 (z) 
increases monotonically with z and is bounded, its largest value being 

g3/2(l) = 1 + ^ + i + "- = ?Q)^ 2 - 612 ’ W 

see equation (D.5) in Appendix D. Hence, for all z of interest, 

g»/2(s)<f(!j. (18) 


Consequently, for given V and T, the total (equilibrium) number of particles in all the 
excited states taken together is also bounded, that is, 


h 6 


(19) 


Now, so long as the actual number of particles in the system is less than this limiting value, 
everything is well and good; practically all the particles in the system are distributed over 
the excited states and the precise value of z is determined by equation (16), with N e ~ N. 
However, if the actual number of particles exceeds this limiting value, then it is natural that 
the excited states will receive as many of them as they can hold, namely 




( 20 ) 


while the rest will be pushed en masse into the ground state e = 0 (whose capacity, under 
all circumstances, is essentially unlimited): 


Nq = N — 


^*^( 3)1 


( 21 ) 


The precise value of z is now determined by the formula 


Np 

Nq + 1 



( 22 ) 


which, for all practical purposes, is unity. This curious phenomenon of a macroscopi- 
cally large number of particles accumulating in a single quantum state (e = 0) is generally 
referred to as the phenomenon of Bose-Einstein condensation. In a certain sense, this 
phenomenon is akin to the familiar process of a vapor condensing into the liquid state, 
which takes place in the ordinary physical space. Conceptually, however, the two pro¬ 
cesses are very different. Firstly, the phenomenon of Bose-Einstein condensation is purely 


3 Remember that the largest value z can have in principle is unity. In fact, as T —>■ 0, z — Nq/(Nq + 1) —► N/(N + 1), 
which is very nearly unity (but certainly on the right side of it). 
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of quantum origin (occurring even in the absence of intermolecular forces); secondly, it 
takes place at best in the momentum space and not in the coordinate space . 4 
The condition for the onset of Bose-Einstein condensation is 




or, if we hold N and V constant and vary T, 


T <T C = 


h 2 

2nmk 



(23) 


(24) 5 


here, T c denotes a characteristic temperature that depends on the particle mass m and 
the particle density N/V in the system. Accordingly, for T < T c , the system may be looked 
on as a mixture of two “phases”: 

(i) a normal phase, consisting of N e {— N( T/ T c ) 3 l 2 } particles distributed over the excited 
states (e ^ 0 ), and 

(ii) a condensed phase, consisting of N 0 {= (N — N e )} particles accumulated in the ground 
state (e = 0 ). 

Figure 7.1 shows the manner in which the complementary fractions ( N e /N ) and (N 0 /N) 
vary with T. For T > T c , we have the normal phase alone; the number of particles in the 
ground state, namely z/( 1 — z), is 0(1), which is completely negligible in comparison with 
the total number N. Clearly, the situation is singular at T = T c . For later reference, we note 
that, at T —>■ T c from below, the condensate fraction vanishes as follows: 


Np / T\ 3/2 _ 3 T C -T 

N \Tc) ~ 2 T c 


(25) 


A knowledge of the variation of z with T is also of interest here. It is, however, sim¬ 
pler to consider the variation of z with (v/X. 3 ), the latter being proportional to T 3/2 . 
For 0 < ( v/X 3 ) < (2.612) -1 , which corresponds to 0 < T < T c , the parameter z~ 1; see 
equation (22). For (v/X 3 ) > (2.612) -1 , z < 1 and is determined by the relationship 

g?,/ 2 (z) = ( X 3 /v ) < 2.612; (26) 6 

4 0f course, the repercussions of this phenomenon in the coordinate space are no less curious. It prepares the stage 
for the onset of superfluidity, a quantum manifestation discussed in Section 7.6. 

5 For a rigorous discussion of the onset of Bose-Einstein condensation, see Landsberg (1954b), where an attempt 
has also been made to coordinate much of the previously published work on this topic. For a more recent study, see 
Greenspoon and Pathria (1974), Pathria (1983), and Appendix F. 

6 An equivalent relationship is g3/2(z)/g3/2(l) = (T c /T) 3 t 2 < 1. 
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FIGURE 7.1 Fractions of the normal phase and the condensed phase in an ideal Bose gas as a function of the 
temperature parameter (T/T c ). 



(WA 3 )-► 


FIGURE 7.2 The fugacity of an ideal Bose gas as a function of O/A 3 ). 


see equation (8). For (v/k 3 )» 1, we have g 3 / 2 fz)<sC 1 and, hence, z<<C 1. Under these 
circumstances, g 3 / 2 (z)~z; see equation (10). Therefore, in this region, z ~ (y/A. 3 ) -1 , in 
agreement with the classical case. 7 Figure 7.2 shows the variation of z with ( v/X 3 ). 

Next, we examine the ( P , T )-diagram of this system, that is, the variation of P with T, 
keeping v fixed. Now, for T < T c , the pressure is given by equation (7), with z replaced by 
unity: 


( 27 ) 

which is proportional to T r>/2 and is independent of v — implying infinite compressibility. 
At the transition point the value of the pressure is 

p{Tc)= (^y 2 (kTc) 5 / 2 t(i) ^ 

7 Equation (6.2.12) gives, for an ideal classical gas, In£2 = zV/ A 3 . Accordingly, N = z(31n<2/3z) =z(V/A 3 ), with the 
result that z = (A 3 /!)). 
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with the help of (24), this can be written as 


P(T C ) = - 

t. 

Thus, the pressure exerted by the particles of an ideal Bose gas at the transition temper¬ 
ature T c is about one-half of that exerted by the particles of an equivalent Boltzmannian 
gas. 8 For T > T c , the pressure is given by 


^("*7 t ) = 0.5134(£tti). (29) 


N g 5/2 ( Z ) 

V g3/2(z)’ 

where z(T) is determined by the implicit relationship 

.t 3 N h 3 

g3/2(Z ) = T = V (2nmkT) 3 / 2 ' 


(30) 


(26a) 


Unless T is very high, the pressure P cannot be expressed in any simpler terms; of course, 
for T T c , the virial expansion (13) can be used. As T —> oo, the pressure approaches the 
classical value NkT/V. All these features are shown in Figure 7.3. The transition line in 
the figure portrays equation (27). The actual (P, T )-curve follows this line from T = 0 up 
to T = T c and thereafter departs, tending asymptotically to the classical limit. It may be 
pointed out that the region to the right of the transition line belongs to the normal phase 
alone, the line itself belongs to the mixed phase, while the region to the left is inaccessible 
to the system. 

In view of the direct relationship between the internal energy of the gas and its pres¬ 
sure, see equation (12), Figure 7.3 depicts equally well the variation of U with T (of course, 
with v fixed). Its slope should, therefore, be a measure of the specific heat CV(T) of the gas. 
We readily observe that the specific heat is vanishingly small at low temperatures and rises 
with T until it reaches a maximum at T — T c ; thereafter, it decreases, tending asymptoti¬ 
cally to the constant classical value. Analytically, for T < T c , we obtain [see equations (15) 
and (27)] 


CV _ 3 V / 5\ d / T\ 
Nk ~ 2 IV \2/ drylv 



V 

I 3 ’ 


(31) 


8 Actually, for all T < T c , we can write 

P(T) = P(T C ) ■ (TIT c pl 2 ~ 0.5134(M e fcr/V). 

We infer that, while particles in the condensed phase do not exert any pressure at all, particles in the excited states are 
about half as effective as in the Boltzmannian case. 
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FIGURE 7.3 The pressure and the internal energy of an ideal Bose gas as a function of the temperature parameter 

(T/T c ). 


which is proportional to T 3 ’-' 2 . At T — T c , we have 

Cy{T c ) _ 15?(§) 
Nk =4,(3) 


1.925, 


(32) 


which is significantly higher than the classical value 1.5. For T > T c , we obtain an implicit 
formula. First of all, 


Cy _ 3 /3^ g5/2(z) \ 
Nk~[dT\2 g3l2(Z))\ v ’ 


(33) 


see equations (11) and (26). To carry out the differentiation, we need to know (dz/dT) v ; 
this can be obtained from equation (26) with the help of the recurrence relation (D.10) in 
Appendix D. On one hand, since gs/ 2 (z) cx T -3 / 2 , 


' d 

dT 


g3/2(Z) 


V 


3 

2 T 


g3/2(zy, 


(34) 


on the other, 


Z — g3/2(z) =gl/ 2 (Z). 


(35) 


Combining these two results, we obtain 


3 g3/2(z) 
2 T g\ /2 (z)' 


3 z 
a t 


( 36 ) 
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Equation (33) now gives 


CV _ 15g5/2( z ) 9g3/2(£). 
Nk~ 4 g 3 / 2 (z) 4 gi/ 2 (z) ’ 


(37) 


the value of z, as a function of T, is again to be determined from equation (26) . In the 
limit z —> 1, the second term in (37) vanishes because of the divergence of g\/ 2 (z), while 
the first term gives exactly the result appearing in (32). The specific heat is, therefore, con¬ 
tinuous at the transition point. Its derivative is, however, discontinuous, the magnitude of 
the discontinuity being 



(38) 


see Problem 7.6. For T > T c , the specific heat decreases steadily toward the limiting value 



(39) 


Figure 7.4 shows all these features of the (Cy, T) -relationship. It may be noted that it 
was the similarity of this curve with the experimental one for liquid He 4 (Figure 7.5) that 
prompted F. London to suggest, in 1938, that the curious phase transition that occurs in 
liquid He 4 at a temperature of about 2.19 K might be a manifestation of the Bose-Einstein 
condensation taking place in the liquid. Indeed, if we substitute, in (24), data for liquid 
He 4 , namely m — 6.65 x 10 -24 g and V = 27.6cm 3 /mole, we obtain for T c a value of about 
3.13K, which is not drastically different from the observed transition temperature of the 
liquid. Moreover, the interpretation of the phase transition in liquid He 4 as Bose-Einstein 
condensation provides a theoretical basis for the two-fluid model of this liquid, which was 
empirically put forward by Tisza (1938a,b) to explain the physical behavior of the liquid 
below the transition temperature. 

According to London, the No particles that occupy a single, entropyless state (e = 0) 
could be identified with the “superfluid component’’ of the liquid and the N e particles 
that occupy the excited states (e ^ 0) with the “normal component.” As required in the 
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FIGURE 7.4 The specific heat of an ideal Bose gas as a function of the temperature parameter (T/T c ). 
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FIGURE 7.5 The specific heat of liquid He 4 under its own vapor pressure (after Keesom and coworkers). 

model of Tisza, the superfluid fraction makes its appearance at the transition tempera¬ 
ture T c , and builds up at the cost of the normal fraction until at T = 0 the whole fluid 
becomes superfluid; compare to Figure 7.1. Of course, the actual temperature dependence 
of these fractions, and of other physical quantities pertaining to liquid He 4 , is consider¬ 
ably different from what the simple-minded ideal Bose gas suggests. London had expected 
that the inclusion of intermolecular interactions would improve the quantitative agree¬ 
ment. Although this expectation has been partially vindicated, there have been other 
advances in the field that provide alternative ways of looking at the helium problem; see 
Section 7.6. Nevertheless, many of the features provided by London’s interpretation of this 
phenomenon continue to be of value. 

Historically, the experimental measurements of the specific heat of liquid He 4 , which 
led to the discovery of this so-called He I-He II transition, were first made by Keesom in 
1927 and 1928. Struck by the shape of the (Cy, T)- curve, Keesom gave this transition the 
name k-transition; as a result, the term transition temperature (or transition point) also 
came to be known as /.-temperature (or /.-point). 

We shall now look at the isotherms of the ideal Bose gas; that is, the variation of the 
pressure of the gas with its volume, keeping T fixed. The Bose-Einstein condensation now 
sets in at a characteristic volume v c , given by 



(40) 


see (23) . We note that v c cx T 3 / 2 . For v < v c , the pressure of the gas is independent of v and 
is given by 



(41) 
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FIGURE 7.6 The isotherms of an ideal Bose gas. 


see (27). The region of the mixed phase in the (P, h)-diagram is marked by a boundary line 
(called the transition line ) given by the equation 


iW /3 = 


h 2 ?(§) 

^i^oT 


const.; 


(42) 


see Figure 7.6. Clearly, the region to the left of this line belongs to the mixed phase, while 
the region to the right belongs to the normal phase alone. 

Finally, we examine the adiabats of the ideal Bose gas. For this, we need an expression 
for the entropy of the system. Making use of the thermodynamic formula 


U —TS + PV = Nfi (43) 

and the expressions for U and P obtained above, we get 

5 g5/2fz) _ lnz for T > T (44a) 

2 g 3 / 2 Cz) 

irHi) forr - Ii: t44b) 

again, the value of z(T), for T > T c , is to be obtained from equation (26). Now, a reversible 
adiabatic process implies the constancy of S and N. For T > T c , this implies the constancy 
of z as well and in turn, by (26), the constancy of (v//?). For T < T c , it again implies the 
same. We thus obtain, quite generally, the following relationship between the volume and 
the temperature of the system when it undergoes a reversible adiabatic process: 

vT 3/2 = const. (45) 

The corresponding relationship between the pressure and the temperature is 


S _U+PV p 
Nk = NkT ~ IcT 


P/T 5/2 = const.; 


(46) 
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see equations (7) and (27) . Eliminating T, we obtain 


p v 5/3 _ conS ( 


(47) 


as the equation for an adiabat of the ideal Bose gas. 

Incidentally, the foregoing results are exactly the same as for an ideal classical gas. There 
is, however, a significant difference between the two cases; that is, while the exponent | in 
formula (47) is identically equal to the ratio of the specific heats Cp and Cv in the case of 
the ideal classical gas, it is not so in the case of the ideal Bose gas. For the latter, this ratio 
is given by 


Cp 

Cv 


1 4Cygi/2(Z) 

9 Nkg 3/2 (z) 

(48a) 

5g5/2(.Z)gl/2(Z)_ 

3 lg3/2(-Z)} 2 ’ 

(48b) 


see Problems 7.4 and 7.5. It is only for T » T c that y ~ |. At any finite temperature, y > | 
and as T -*■ T c , y -»• oo. Equation (47), on the other hand, holds for all T. 

In the mixed-phase region (T < T c ), the entropy of the gas may be written as 


S = N e --k 
2 


f(D 


ocN e ; 


(49) 


see equations (20) and (44b) . As expected, the No particles that constitute the “condensate” 
do not contribute toward the entropy of the system, while the N e particles that constitute 
the normal part contribute an amount of |k£(|)/£(|) per particle. 


7.2 Bose-Einstein condensation in ultracold 
atomic gases 

The first demonstration of Bose-Einstein condensation in ultracold atomic gases came in 
1995. Cornell and Wieman Bose-condensed 87 Rb (Anderson, Ensher, Matthews, Wieman, 
and Cornell (1995)) and Ketterle Bose-condensed 23 Na (Davis, Mewes, Andrews, van 
Druten, Durfee, Kurn, and Ketterle (1995)) using magneto-optical traps (MOTs) and 
magnetic traps to cool vapors of tens of thousands of atoms to temperatures of a few 
nanokelvin. 9 A survey of the theory and experiments can be found in Pitaevskii and 
Stringari (2003), Leggett (2006), and Pethick and Smith (2008). 

The first step of the cooling of the atomic vapor uses three sets of counter-propagating 
laser beams oriented along cartesian axes that are tuned just below the resonant frequency 

9 Since 1995, many isotopes have been Bose-condensed including 7 Li, 23 Na, 41 K, 52 Cr, 84 Sr, 85 Rb, 87 Rb, 133 Cs, and 
174 Yb. The first molecular Bose-Einstein condensates were created in 2003 by the research groups of Rudolf Grimm at the 
University of Innsbruck, Deborah S. Jin at the University of Colorado at Boulder, and Wolfgang Ketterle at Massachusetts 
Institute of Technology. 
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of the atoms in the trap. Atoms that are stationary are just off resonance and so rarely 
absorb a photon. Moving atoms are Doppler shifted on resonance to the laser beam that is 
propagating opposite to the velocity vector of the atom. Those atoms preferentially absorb 
photons from that direction and then reemit in random directions, resulting in a net 
momentum kick opposite to the direction of motion. This results in an “optical molasses” 
that slows the atoms. This cooling method is constrained by the “recoil limit” in which the 
atoms have a minimum momentum of the order of the momentum of the photons used 
to cool the gas. This gives a limiting temperature of ( hf) 2 /2mc 2 k 1 |xK, where / is the 
frequency of the spectral line used for cooling and m is the mass of an atom. 

In the next step of the cooling process, the lasers are turned off and a spatially vary¬ 
ing magnetic field creates an attractive anisotropic harmonic oscillator potential near the 
center of the magnetic trap 


V (r) = ^ m (wfx 2 + + ofz 2 ). (1) 

The frequencies of the trap a> a are controlled by the applied magnetic field. One can then 
lower the trap barrier using a resonant transition to remove the highest energy atoms in 
the trap. If the atoms in the vapor are sufficiently coupled to one other, then the remaining 
atoms in the trap are cooled by evaporation. 

If the interactions between the atoms in the gas can be neglected, the energy of each 
atom in the harmonic oscillator potential is 


s h,I 2 ’h = ha>ih + ha>2h + h + - h(a>\ +a>2 + M 3 ), ( 2 ) 

where l a (= 0,1,2,.. . 00 ) are the quantum numbers of the harmonic oscillator. If the 
three frequencies are all the same, then the quantum degeneracy of a level with energy 
e — hco(l + 3/2) is (7+ l)(7 + 2)/2; see Problem 3.26. 

For the general anisotropic case, the smoothed density of states as a function of energy 
(suppressing the zero point energy and assuming e » hw a ) is given by 


ou 00 CXJ 

m l^h) dhdhdh = 


0 0 0 


2 


(3) 


where &>o = (o>ia | 2 « 3 ) 1/3 ; this assumes a single spin state per atom. The thermodynamic 
potential n, see Appendix H, for bosons in the trap is then given by 


m 4 

2 (hco 0 ) 3 


OO 

/V l„(i- e -v»)* = 

0 


m 4 

(ha>o ) 3 


g4(z), 


n(M,n = 


(4) 
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where z = exp(fin) is the fugacity and g v (z) is dehned in Appendix D. Volume is not a 
parameter in the thermodynamic potential since the atoms are confined by the har¬ 
monic trap. The average number of atoms in the excited states in the trap is 


N(n, T) 



(5) 


For fixed N, the chemical potential monotonically increases as temperature is lowered 
until Bose-Einstein condensation occurs when p = 0 (z = 1). The critical temperature for 
N trapped atoms is then given by 


fi®o U(3)/ 

where £(3) = g 3 (l) ~ 1.202. While the spacing of the energy levels is of order hw o, the crit¬ 
ical temperature for condensation is much larger than the energy spacing of the lowest 
levels for N » 1. A typical magnetic trap oscillation frequency/ ~ 100Hz. For N — 2 x 10 4 , 
as in Cornell and Wieman’s original experiment, lcTc/hcoo ~ 25.5. The observed critical 
temperature was about 170nK (Anderson et al. (1995)). 

ForT < Tc f the number of atoms in the excited states is 


N excited 

m , 

( kT \ 3 

( T \ 

N 

N ' 

\hcL> 0 J 

\Tc) 


so the fraction of atoms that condense into the ground state of the harmonic oscillator is 



( 8 ) 


see de Groot, Hooyman, and ten Seldam (1950), and Bagnato, Pritchard, and Kleppner 
(1987). In the thermodynamic limit, a nonzero fraction of the atoms occupy the ground 
state for T < T c . By contrast, the occupancy of the first excited state is only of order V 1 - /3 , so 
in the thermodynamic limit the occupancy fraction in each excited state is zero. A compar¬ 
ison of the experimentally measured Bose-condensed fraction with equation (8) is shown 
in Figure 7.7. 


7.2.A Detection of the Bose-Einstein condensate 

The linear size of the ground state wavefunction in cartesian direction a is 


O' 



a. 


moj c 


(9) 
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FIGURE 7.7 Experimental measurement of the Bose-condensed fraction vs. temperature, as compared to 
equation (8). The scaled temperature on the horizontal axis is the temperature divided by the TV-dependent critical 
temperature given in equation (6). The inset shows the total number of atoms in the trap after the evaporative 
cooling. From Ensher et al. (1996). Reprinted with permission; copyright © 1996, American Physical 
Society. 

while the linear size of the thermal distribution of the noncondensed atoms in that 
direction is 



( 10 ) 


At trap frequency f— 100 Hz and temperature T = 100 nK, these sizes are about 1pm 
and 5 pm, respectively. Instead of measuring the atoms directly in the trapping potential, 
experimenters usually measure the momentum distribution of the ultracold gas by a time- 
of-flight experiment. At time t — 0, the magnetic field is turned off suddenly, eliminating 
the trapping potential. The atomic cloud then expands according to the momentum dis¬ 
tribution the atoms had in the harmonic trap. The cloud is allowed to expand for about 100 
milliseconds. The speed of the atoms at this temperature is a few millimeters per second, 
so the cloud expands to a few hundred microns in this period of time. The cloud is then 
illuminated with a laser pulse on resonance with the atoms, leaving a shadow on a CCD 
in the image plane of the optics. The size and shape of the light intensity pattern directly 
measures the momentum distribution the atoms had in the trap at t — 0. The expanding 
cloud can be divided into two components, the No atoms that had been Bose-condensed 
into the ground state and the remaining N — No atoms that were in the excited states of the 
harmonic oscillator potential. The Bose-condensed atoms have smaller momenta than the 
atoms that were in the excited states. After time f, the quantum evolution of the ground 
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state has a spatial number density 


n 0 (r,t) = N 0 \^ 0 (r,t)\ 2 = fj 

a=l 




-r% 


( 11 ) 


see Pitaevskii and Stringari (2003), Pethick and Smith (2008), and Problem 7.15. 

The atoms that are not condensed into the ground state can be treated semiclassi- 
cally, that is, the position-momentum distribution function is treated classically while the 
density follows the Bose-Einstein distribution function: 


f(r,p, 0) = 


ex P (Ifr + ("I* 2 +"It 2 + <4z 2 ) - Pu) -1 

After the potential is turned off at t = 0, the distribution evolves ballistically: 


( 12 ) 


f(r,p,t) =/( r+ ~>P>0 ). 


The spatial number density of atoms in the excited states is 

«excited(r,f) = p J f(r+^,p,t^dp, 
which can be integrated to give 


(13) 


(14) 


1 ~ e /*W I - 

«excited( r T) ~ ^2—, j3/2 \ 1 1 


ct=l 




-Pjma>lrl 


(15) 


where X = h/-J'I tt rnlcT is the thermal deBroglie wavelength; see Pethick and Smith (2008), 
and Problem 7.16. The integrals over the condensed state and the excited states correctly 
count all the atoms: 


Nq = J no(r,t)dr, 

N — Nq = / « exc ited( r > t)dr = ' , V exc j [e( -|, 


(16a) 

(16b) 


see Problem 7.18. 

Note that at early times (co a t <<c 1) both the condensed and the excited distributions are 
anisotropic due to the anisotropic trapping potential. However, at late times {a> a , t^> 1), the 
atoms from the excited states form a spherically symmetric cloud because of the isotropic 
momentum dependence of the t = 0 distribution function. By contrast, the atoms that 
were condensed into the ground state expand anisotropically due to the different spa¬ 
tial extents of the ground state wavefunction at t — 0. The direction that has the largest 
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FIGURE 7.8 The two-dimensional time-of-flight number density equations (11) and (15) at late times (a>of 3> 1) for 
T/T c = 0.98 using the experimental parameters in Anderson et al. (1995): N = 2 x 10 4 atoms in the trap and 
oi 2 = y/Sa>i. The plot shows the full density and, underneath, the broader isotropic density just due to the excited 
states. The z-dimension has been integrated out. The Bose-condensed peak is anisotropic: the y-direction spread is 
8 1 / 4 = 1.68 times larger than in the x-direction while the broad peak caused by the excited states is isotropic. The 
distance scale Vq t— t^/huii/m determines the width of the distribution that results from the Bose-condensed peak 
in the x-direction at late times; compare to Figure 7.9. 


w a is quantum mechanically squeezed the most at t = 0; so, according to the uncertainty 
principle, it expands the fastest. This is an important feature of the experimental data that 
confirms the onset of Bose-Einstein condensation; 10 see Figures 7.8 and 7.9. 

7.2.B Thermodynamic properties of the Bose-Einstein condensate 

The temperature, condensate fraction, and internal energy can all be observed using time- 
of-flight measurements. The internal energy can also be written in terms of the function 

gv(z): 


U(n, T) 


CXJ 

/ 


1 


2(ha> 0 ) 3 - 1 


de = 3 


m 4 

(hai 0 y 




(17) 


'“Repulsive interactions between atoms create additional forces that modify the time-of-flight expansion. This is 
especially important in condensates with a very large numbers of atoms, as many as 10 7 or more in some experiments; 
see Section 11.2 A 
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FIGURE 7.9 Time-of-flight images from the first observation of Bose-Einstein condensation in a dilute vapor of 87 Rb 
by Anderson et al. (1995) at temperatures just above and below the phase transition temperature. The anisotropic 
pattern of the Bose-condensed fraction is evident; compare to Figure 7.8. Courtesy of NIST/JILA/University of 
Colorado. 


The heat capacity at constant number can be written as 


Cn(T) = 





(18) 


Equations (5) and (6) can be used to determine the fugacity z numerically, as shown in 
Figure 7.10(a). The fugacity can then be used in equation (17) to obtain the scaled internal 
energy 


U 

NkTc 



T\ 4 g 4 (z) 


for T < T c , 


(19) 


3 


T c ) ?(3) 


for T > T c \ 
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see Figures 7.10(b) and 7.12. The scaled specific heat is given by 


Qv 

Nk 


12?(4) / r\ 3 
?(3) \T C ) 


1 

?(3) 


^2g 4 (z) 


9gf(z)\ 
g 2 (z) ) 


for T < T c , 


for T > T c , 


(20) 


and is shown in Figure 7.11. Unlike the case of Bose-Einstein condensation of free parti¬ 
cles in a box (Figure 7.4), the specific heat of a condensate in a harmonic trap displays a 




FIGURE 7.10 Fugacity (a) and scaled internal energy (b) vs. scaled temperature (T/T c ) for a Bose-Einstein 
condensate in a harmonic trap. 



FIGURE 7.11 Scaled specific heat of a Bose-Einstein condensate in a harmonic trap as a function of the scaled 
temperature (T/T c ); compare with Figure 7.4 for a free-particle Bose gas. 
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discontinuity at the critical temperature: 


Qv 

Nk 


12? (4) 

~ 10.805 

as T - 

± T~ 

?(3) 

* 1 c 

12? (4) 


as T - 

± T+ 

?(3) 

?(2) 

* 1 c 


( 21 ) 


Figure 7.12 shows experimental data for the internal energy of a Bose-Einstein con¬ 
densate of 87 Rb. The break in slope is an indication of the discontinuous specific heat. 
Naturally, in a system with a finite number of particles, all nonanalyticities associated with 
the phase transition are removed. When N is finite, the condensate fraction approaches 
zero smoothly and the discontinuity in the heat capacity is rounded off. Pathria (1998) has 
derived iV-dependent temperature markers that indicate the onset of Bose-Einstein con¬ 
densation in terms of the condensate fraction and the specific heat; see also Kirsten and 
Toms (1996) and Haugerud, Haugest, and Ravndal (1997). 



FIGURE 7.12 Comparison of the experimental measurements of Ensher et al. (1996) (.diamonds) with the 
noninteracting internal energy result — see equation (19) and Figure 7.10(b) — (dotted curve), the zero-order 
solution including interactions (full curve), first-order perturbative treatment (dashed curve), and numerical 
solution (circles). The straight line is the classical Maxwell-Boltzmann result. The inset is an enlargement of 
the region around the critical temperature. The break in slope is an indication of the discontinuity in the 
thermodynamic limit specific heat shown in Figure 7.11; from Minguzzi, Conti, and Tosi (1997). Reprinted 
with permission; copyright © 1997, American Institute of Physics. 
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7.3 Thermodynamics of the blackbody radiation 

One of the most important applications of Bose-Einstein statistics is to investigate the 
equilibrium properties of the blackbody radiation. We consider a radiation cavity of vol¬ 
ume V at temperature T. Historically, this system has been looked on from two, practically 
identical but conceptually different, points of view: 

(i) as an assembly of harmonic oscillators with quantized energies ( n s + \)hco s> where 
n s = 0,1,2,..., and to s is the (angular) frequency of an oscillator, or 

(ii) as a gas of identical and indistinguishable quanta — the so-called photons — the 
energy of a photon (corresponding to the frequency a> s of the radiation mode) 
being fuo s . 

The first point of view is essentially the one adopted by Planck (1900), except that we 
have also included here the zero-point energy of the oscillator; for the thermodynamics 
of the radiation, this energy is of no great consequence and may be dropped altogether. 
The oscillators, being distinguishable from one another (by the very values of &> s ), would 
obey Maxwell-Boltzmann statistics; however, the expression for the single-oscillator par¬ 
tition function Qi(H, T) would be different from the classical expression because now the 
energies accessible to the oscillator are discrete, rather than continuous; compare to equa¬ 
tions (3.8.2) and (3.8.14). The expectation value of the energy of a Planck oscillator of 
frequency w s is then given by equation (3.8.20), excluding the zero-point term \ fko s : 


ha> s 

^ = gfitoj/fcr _ i ' 


( 1 ) 


Now, the number of normal modes of vibration per unit volume of the cavity in the 
frequency range (o>, o> + da>) is given by the Rayleigh expression 


2 • 47r 




o 2 da 


( 2 ) 


where the factor 2 has been included to take into account the duplicity of the transverse 
modes; 11 the symbol c here denotes the speed of light. By equations (1) and (2), the energy 
density associated with the frequency range (o), co + dto) is given by 

H of dm 

u{co)dco - eha]/kT _ 1 > ( 3 ) 

which is Planck’s formula for the distribution of energy over the blackbody spectrum. 
Integrating (3) over all values of w, we obtain the total energy density in the cavity. 

The second point of view originated with Bose (1924) and Einstein (1924, 1925). Bose 
investigated the problem of the “distribution of photons over the various energy levels” 
in the system; however, instead of worrying about the allocation of the various photons 

11 As is well-known, the longitudinal modes play no role in the case of radiation. 
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to the various energy levels (as one would have ordinarily done), he concentrated on the 
statistics of the energy levels themselves! He examined questions such as the “probability 
of an energy level e s (= Hco s ) being occupied by n s photons at a time,’’ “the mean values of 
n s and e s ,” and so on. The statistics of the energy levels is indeed Boltzmannian; the mean 
values of n s and s s , however, turn out to be 

oo / oo 

(n s ) = n s e- nsKo,s/kT / e~ nsKl ° s/kT 
n s = 0 / n s = 0 

1 

=- (4) 

gtuos/kT _i v ' 


and hence 


(e s ) = hco s {n s ) 


hco s 

gfiws / IcT _ 


(5) 


identical with our earlier result (1). To obtain the number of photon states with momenta 
lying between hw/c and h (ca + da>)/c, Bose made use of the connection between this 
number and the “volume of the relevant region of the phase space,” with the result 


g{ai)dco ss 2 • 



V cd 2 dco 


(6) 12 


which is also identical to our earlier result (2). Thus, he finally obtained the distribution 
formula of Planck. It must be noted here that, although emphasis lay elsewhere, the math¬ 
ematical steps that led Bose to his final result went literally parallel to the ones occurring 
in the oscillator approach! 

Einstein, on the other hand, went deeper into the problem and pondered over the 
statistics of both the photons and the energy levels, taken together. He inferred (from 
Bose’s treatment) that the basic fact to keep in mind during the process of distributing 
photons over the various energy levels is that the photons are indistinguishable — a fact 
that had been implicitly taken care of in Bose’s treatment. Einstein’s derivation of the 
desired distribution was essentially the same as given in Section 6.1, with one important 
difference, that is, since the total number of photons in any given volume was indefinite, 
the constraint of a fixed N was no longer present. As a result, the Lagrange multiplier a did 
not enter into the discussion and to that extent the final formula for {n s ) was simpler: 


compare to equation (6.1.18a) or (6.2.22). The foregoing result is identical to (4), with 
e = hco s . The subsequent steps in Einstein’s treatment were the same as in Bose’s. 

12 The factor 2 in this expression arises essentially from the same cause as in the Rayleigh expression (2). However, in 
the present context, it would be more appropriate to regard it as representing the two states of polarization of the photon 
spin. 
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FIGURE 7.13 The spectral distribution of energy in the blackbody radiation. The solid curve represents the 
quantum-theoretical formula of Planck. The long-wavelength approximation of Rayleigh-Jeans and the 
short-wavelength approximation of Wien are also shown. 


Looking back at the two approaches, we note that there is a complete correspondence 
between them — “an oscillator in the eigenstate n s , with energy (n s + \)hco s " in the first 
approach corresponds to “the occupation of the energy level hco s by n s photons” in the 
second approach, “the average energy (e s ) of an oscillator” corresponds to “the mean 
occupation number (n s ) of the corresponding energy level,” and so on. 13 

Figure 7.13 shows a plot of the distribution function (3), which may be written in the 
dimensionless form 

, , x 3 dx 

u(x)dx = -, (8) 

e x -l 


where 


, n 2 h 3 c 3 . Ha> 

u ( x) = u(x) and x=—. (9) 

For long wavelengths (jc <§; 1), formula (8) reduces to the classical approximation of 
Rayleigh (1900) and leans (1905), namely 14 

( 10 ) 

while for short wavelengths (x » 1), it reduces to the rival formula of Wien (1896), namely 

u'(x) - r 3 e~L (11) 


13 Compared to the standard Bose-Einstein result (7.1.2), formula (7) suggests that we are dealing here with a case for 
which z is precisely equal to unity. It is not difficult to see that this is due to the fact that the total number of particles in 
the present case is indefinite. For then, their equilibrium number N has to be determined by the condition that the free 
energy of the system is at its minimum, that is, {(dA/dN) N= fj} v,t — 0, which, by definition, implies that [x. = 0 and hence 
z= 1 . 

14 The Rayleigh-Jeans formula follows directly if we use for ( e s ) the equipartition value kT rather than the quantum- 
theoretical value (1). 
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For comparison, the limiting forms (10) and (11) are also included in the figure. We note 
that the areas under the Planck curve and the Wien curve are jt 4 /15(~ 6.49) and 6, respec¬ 
tively. The Rayleigh-Jeans curve, however, suffers from a high-frequency catastrophe! 

For the total energy density in the cavity, we obtain from equations (8) and (9) 


U 

V 


OO 

J u(x)dx = 
0 


( kT ) 4 
7r 2 fi 3 c 3 


/ 


x 3 dx 

e*-\ 


o 


7T 2 fc 4 

15fi 3 c 3 


rp 4 


( 12) 15 


If there is a small opening in the walls of the cavity, the photons will “effuse” through 
it. The net rate of flow of the radiation, per unit area of the opening, will be given by, see 
equation (6.4.12), 


1 U _ n 2 k 4 
4 V 60fi 3 c 2 


T 4 = oT \ 


(13) 


where 


2 k 4 


60fi 3 c 2 


= 5.670 x 10“ 8 Wm“ 2 KT 4 . 


(14) 


Equation (13) describes the Stefan-Boltzmann law of blackbody radiation, a being the Ste¬ 
fan constant. This law was deduced from experimental observations by Stefan in 1879; five 
years later, Boltzmann derived it from thermodynamic considerations. 

For further study of thermodynamics, we evaluate the grand partition function of the 
photon gas. Using equation (6.2.17) with z — 1, we obtain 


PV 


\n&(V, T) = — = — ^ln(l — 


„-e/kT 


)• 


(15) 


Replacing summation by integration and making use of the extreme relativistic formula 


a(e)de = 2V 


4np 2 dp 8irV 2 


h 3 


h 3 c 3 


e 2 d£, 


(16) 


we obtain, after an integration by parts, 


, PV 8nV 1 

In &(V t T)= — = 


kT 3 h 3 c 3 kT J e E / kT - 1' 


CXJ 

/; 


e 3 de 


15 Here, use has been made of the fact that the value of the definite integral is (4) = tt 4 /15; see Appendix D. 
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By a change of variable, this becomes 


PV = 


8nV 
3 h 3 c 3 




8tt 5 V 
45 h 3 c 3 


( kT ) 4 = -U. 
3 


(17) 


We thus obtain the well-known result of the radiation theory; that is, the pressure of the 
radiation is equal to one-third its energy density; see also equations (6.4.3) and (6.4.4). 
Next, since the chemical potential of the system is zero, the Helmholtz free energy is equal 
to —PV\ therefore 


whereby 


and 


A = —PV = — U, 
3 


„ U-A 4U T o 

S =-=-cx VT 3 

T 3 T 


^- r (SX- 3s - 


(18) 


(19) 


( 20 ) 


If the radiation undergoes a reversible adiabatic change, the law governing the variation of 
T with V would be, see (19), 


VT 3 = const. (21) 

Combining (21) with the fact that P cx T 4 , we obtain an equation for the adiabats of the 
system, namely 


py 4 / 3 _ const. 


( 22 ) 


It should be noted, however, that the ratio Cp/Cv of the photon gas is not 4/3; it is infinite! 

Finally, we derive an expression for the equilibrium number N of photons in the 
radiation cavity. We obtain 


N = 


tx, 

/ 


V 


2 q 3 J ghco/kT _ y 


= V 


2i;(3){kT) 3 , 

s Y. „ ’ oc VT 3 . 


r 2 h 3 c 3 


(23) 


Instructive though it may be, formula (23) cannot be taken at its face value because in the 
present problem the magnitude of the fluctuations in the variable N, which is determined 
by the quantity (3P/3 V) -1 , is infinitely large; see equation (4.5.7). 
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One of the most important examples of blackbody radiation is the 2.7 K cosmic 
microwave background, which is a remnant from the Big Bang. Equations (12) and (23) 
play an important role in our understanding of the thermodynamics of the early universe; 
see Problem 7.24 and Chapter 9. 


7.4 The field of sound waves 

A problem mathematically similar to the one discussed in Section 7.3 arises from the 
vibrational modes of a macroscopic body, specifically a solid. As in the case of black- 
body radiation, the problem of the vibrational modes of a solid can be studied equally 
well by regarding the system as a collection of harmonic oscillators or by regarding it as 
an enclosed region containing a gas of sound quanta — the so-called phonons. To illus¬ 
trate this point, we consider the Hamiltonian of a classical solid composed of N atoms 
whose positions in space are specified by the coordinates (xi,X 2 ,...,X 3 n). In the state of 
lowest energy, the values of these coordinates maybe denoted by (xi,X 2 ,..., x :i /v). Denoting 
the displacements (x ; — x/) of the atoms from their equilibrium positions by the variables 
%i(i — 1,2,..., 3 AO, the kinetic energy of the system in configuration (xfi is given by 

3 N 3iV 

K =- m Y j X^=- m ^f, ( 1 ) 

1=1 1=1 


and the potential energy by 

4> = $(X;) = *(X ! -) + y'f^) (Xi-Xi) 

j (Xt-Xi)(Xj-Xj)- (2) 

/ (*;)=(*,■) 


V- 1 


a 2 <t> 

2 \ dXidXi 


The main term in this expansion represents the (minimum) energy of the solid when all 
the atoms are at rest at their mean positions x,; this energy may be denoted by the symbol 
4>o. The next set of terms is identically zero because the function <t> (x;) has a minimum 
at (Xj) = (xp and hence all its first derivatives vanish there. The second-order terms of the 
expansion represent the harmonic component of the vibrations of the atoms about their 
mean positions. If we assume that the overall amplitude of these vibrations is not large we 
may retain only the harmonic terms of the expansion and neglect all successive ones; we 
are then working in the so-called harmonic approximation. Thus, we may write 
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where 



d 2 <t> \ 

dXidXj ) 

] / (Xi)=(Xi) 


(4) 


We now introduce a linear transformation, from the coordinates ^ to the so-called normal 
coordinates q t , and choose the transformation matrix such that the new expression for the 
Hamiltonian does not contain any cross terms, that is, 


H = 



(5) 


where a>i(i — 1,2,..., 3 N) are the characteristic frequencies of the normal modes of the sys¬ 
tem and are determined essentially by the quantities aq or, in turn, by the nature of the 
potential energy function 0(xd- Equation (5) suggests that the energy of the solid, over 
and above the (minimum) value 5> 0 , may be considered as arising from a set of 3N one¬ 
dimensional, noninteracting, harmonic oscillators whose characteristic frequencies are 
determined by the interatomic interactions in the system. 

Classically, each of the 3 N normal modes of vibration corresponds to a wave of distor¬ 
tion of the lattice, that is, a sound wave. Quantum-mechanically, these modes give rise to 
quanta, called phonons, in much the same way as the vibrational modes of the electromag¬ 
netic field give rise to photons. There is one important difference, however, that is, while 
the number of normal modes in the case of an electromagnetic held is infinite, the num¬ 
ber of normal modes (or the number of phonon energy levels) in the case of a solid is fixed 
by the number of lattice sites. 16 This introduces certain differences in the thermodynamic 
behavior of the sound held in contrast to the thermodynamic behavior of the radiation 
held; however, at low temperatures, where the high-frequency modes of the solid are not 
very likely to be excited, these differences become rather insignihcant and we obtain a 
striking similarity between the two sets of results. 

The thermodynamics of the solid can now be studied along the lines of Section 3.8. First 
of all, we note that the eigenvalues of the Hamiltonian (5) are 


E{ni} = <D 0 + J2 ( ni+ T ) ho,i ’ 


( 6 ) 


where the numbers «,■ denote the “states of excitation’’ of the various oscillators (or, equally 
well, the occupation numbers of the various phonon levels in the system). The internal 
energy of the system is then given by 

U(T) = { <t> 0 + £ \fm J + £ (7) 


16 Of course, the number of phonons themselves is indefinite. As a result, the chemical potential of the phonon gas is 
also zero. 
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The expression within the curly brackets gives the energy of the solid at absolute zero. The 
term <J>o is negative and larger in magnitude than the total zero-point energy, J2i \bwi, of 
the oscillators: together, they determine the binding energy of the lattice. The last term 
in (7) represents the temperature-dependent part of the energy, 17 which determines the 
specific heat of the solid: 


C V (T) = 



(hcL>i/kT) 2 e na>i i kT 

(, ghiOj/kT _ 1)2 


( 8 ) 


To proceed further, we need to know the frequency spectrum of the solid. To obtain 
this from first principles is not an easy task. Accordingly, one obtains this spectrum either 
through experiment or by making certain plausible assumptions about it. Einstein, who 
was the first to apply the quantum concept to the theory of solids (1907), assumed, for 
simplicity, that the frequencies a>i are all equal. Denoting this (common) value by we, the 
specific heat of the solid is given by 


C V (T) = 3NkE(x), 

where E(x) is the so-called Einstein function-. 


E(x) = 


x 2 e x 

(e x - 1) 2 ’ 


(9) 


( 10 ) 


with 


x — Ho) E /kT = & e /T. (11) 

The dashed curve in Figure 7.14 depicts the variation of the specific heat with tempera¬ 
ture, as given by the Einstein formula (9). At sufficiently high temperatures, where T » &e 
and hence x«l, the Einstein result tends toward the classical one, namely 3/V/c. 1 8 At 
sufficiently low temperatures, where T <§; &e and hence x » 1, the specific heat falls expo¬ 
nentially fast and tends to zero as T ->• 0. The theoretical rate of fall, however, turns out to 
be too fast in comparison with the observed one. Nevertheless, Einstein’s approach did at 
least provide a theoretical basis for understanding the observed departure of the specific 
heat of solids from the classical law of Dulong and Petit, whereby CV — 3R — 5.96 calories 
per mole per degree. 

Debye (1912), on the other hand, allowed a continuous spectrum of frequencies, cut 
off at an upper limit cod such that the total number of normal modes of vibration is 3N, 

17 The thermal energy of the solid may well be written as where («,-){= (e h '"^ kT — l) -1 ) is the mean 

occupation number of the phonon levelClearly, the phonons, like photons, obey Bose-Einstein statistics, with p = 0. 

18 Actually, when the temperature is high enough, so that all (han/kT) <SC 1, the general formula (8) itself reduces to the 
classical one. This corresponds to the situation when each of the 3 N modes of vibration possesses a thermal energy kT. 
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FIGURE 7.14 The specific heat of a solid, according to the Einstein model:-, and according to the Debye 

model: —. The circles denote the experimental results for copper. 


that is 


cod 

J g(co)dco = 3 N, ( 12 ) 

0 

where g(m)dm denotes the number of normal modes of vibration whose frequency lies in 
the range (m,m + dm). For g(m), Debye adopted the Rayleigh expression (7.3.2), modified so 
as to suit the problem under study. Writing ci for the velocity of propagation of the longi¬ 
tudinal modes and c.j for that of the transverse modes (and noting that, for any frequency 
m, the transverse mode is doubly degenerate), equation (12) becomes 



(13) 


from which one obtains for the cutoff frequency 



Accordingly, the Debye spectrum may be written as 


g(m) = 


9 N , 
m D 

0 


for co < cojj, 
for co > coj). 


(14) 


(15) 


Before we proceed further to calculate the specific heat of solids on the basis of the 
Debye spectrum, two remarks seem to be in order. First, the Debye spectrum is only an 
idealization of the actual situation obtaining in a solid; it may, for instance, be compared 
with a typical spectrum such as the one shown in Figure 7.15. While for low-frequency 
modes (the so-called acoustic modes) the Debye approximation is reasonable, serious dis¬ 
crepancies are seen in the case of high-frequency modes (the so-called optical modes). At 
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FIGURE 7.15 The normal-mode frequency distribution g(oi ) for aluminum. The solid curve is derived from 
x-ray scattering measurements [Walker (1956)] while the dashed curve represents the corresponding Debye 
approximation. 

any rate, for “averaged” quantities, such as the specific heat, the finer details of the spec¬ 
trum are not very important. Second, the longitudinal and the transverse modes of the 
solid should have their own cutoff frequencies, cod,l and cod.t say, rather than a common 
cutoff at cod, for the simple reason that, of the 3 N normal modes of the lattice, exactly N 
are longitudinal and 2N transverse. Accordingly, we should have, instead of (13), 



We note that the two cutoffs correspond to a common wavelength X min {= (AnV/?,N) } 
which is comparable to the mean interatomic distance in the solid. This is quite reasonable 
because, for wavelengths shorter than , it would be rather meaningless to speak of a 
wave of atomic displacements. 

In the Debye approximation, formula (8) gives 


C V (T) = 3NkD(x 0 ), 


(17) 


where D(x o) is the so-called Debye function: 



(18) 


with 


ha>D ®D 


(19) 


A(> kJ , j , 
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©d being the so-called Debye temperature of the solid. Integrating by parts, the expression 
for the Debye function becomes 


D(x 0 ) = - 


3x 0 
e x o — 


x o 

12 r x 3 dx 
1 + e* —l' 


( 20 ) 


For T^> &£>, which means x 0 1, the function D(x 0 ) may be expressed as a power series 
inxo: 


D(x o) = 1 - 



( 21 ) 


Thus, as T —> oo, Cy 3Me; moreover, according to this theory, the classical result should 
be applicable to within \ percent so long as T > 3©d- For T <<c ©£>, which means xo » 1, 
the function D(x o) may be written as 


0° 

^ 12 f x 5 dx ^ Yn 

(X ° )= 4J e^l + ° (e h 
u o 

_ 4;r 4 _ 4 t r 4 / T \ 3 
5xjj 5 V ©d / 

Thus, at low temperatures the specific heat of the solid obeys the Debye T 3 -law: 

1 277-4 / 7" \ 3 / T 1 \3 

Cy =- Nkl -) = 464.41 -) cal mole” 1 ^ 1 . 

5 \@ D J \& D J 


( 22 ) 


(23) 


It is clear from equation (23) that a measurement of the low-temperature specific heat of 
a solid should enable us not only to check the validity of the T 3 -law but also to obtain an 
empirical value of the Debye temperature ©/> 19 The value of 0 o can also be obtained by 
computing the cutoff frequency cod from a knowledge of the parameters N/V,cl and Cj', 
see equations (14) and (19). The closeness of these estimates is further evidence in favor 
of Debye’s theory. Once ®d is known, the whole temperature range can be covered theo¬ 
retically by making use of the tabulated values of the function D(x o). 20 A typical case was 
shown earlier in Figure 7.14. We saw that not only was the T 3 -law obeyed at low temper¬ 
atures, but also the agreement between theory and experiment was good throughout the 
range of observations. 

19 It can be shown that, according to this theory, deviations from the r 3 -law should not exceed 2 percent so long as 
T < 0 D /1O. However, in the case of metals, one cannot expect to reach a true r 3 -region because, well before that, the 
specific heat of the electron gas might become a dominant contribution (see Section 8.3); unless the two contributions 
are separated out, one is likely to obtain a somewhat suppressed value of 0 d from these observations. 

20 See, for example, Fowler and Guggenheim (1960, p. 144). 
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As another illustration of agreement in the low-temperature regime, we include here 
another plot, Figure 7.16, which is based on data obtained with the KC1 crystal at temper¬ 
atures below 5 K; see Keesom and Pearlman (1953). Here, the observed values of Cv/T are 
plotted against T 2 . It is evident that the data fall quite well on a straight fine from whose 
slope the value of ©d can be determined. One thus obtains, for KC1, &d = 233 ± 3 K, which 
is in reasonable agreement with the values of 230 to 246 K coming from various estimates 
of the relevant elastic constants. 

In Table 7.1 we fist the values of ©£> for several crystals, as derived from the specific heat 
measurements and from the values of the elastic constants. 

In general, if the specific heat measurements of a given system conform to a T 3 -law, one 
may infer that the thermal excitations in the system are accounted for solely by phonons. 
We expect something similar to happen in liquids as well, with two important differences. 
First, since liquids cannot withstand shear stress they cannot sustain transverse modes of 
vibration; a liquid composed of N atoms will, therefore, have only N longitudinal modes 
of vibration. Second, the normal modes of a liquid cannot be expected to be strictly har¬ 
monic; consequently, in addition to phonons, we might have other types of excitation 
such as vortex flow or turbulence (or even a modified kind of excitation, such as rotons 
in liquid He 4 ). 



FIGURE 7.16 A plot of (Cv/T) versus T 2 for KCI, showing the validity of the Debye r 3 -law. The experimental points 
are from Keesom and Pearlman (1953). 


Table 7.1 The Values of the Debye Temperature ®d for Different Crystals 


Crystal 

Pb 

Ag 

Zn 

Cu 

Al 

C 

NaCI 

KCI 

MgO 

0 j 5 from the specific 

88 

215 

308 

345 

398 

-1850 

308 

233 

-850 

heat measurements 

from the elastic 

73 

214 

305 

332 

402 


320 

240 

-950 


constants 
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Now, helium is the only substance that remains liquid at temperatures low enough to 
exhibit the T 3 -behavior. In the case of the lighter isotope, He 3 , the results are strongly influ¬ 
enced by the Fermi-Dirac statistics; as a result, a specific heat proportional to the first 
power of T dominates the scene (see Section 8.1). In the case of the heavier isotope, He 4 , 
the low-temperature situation is completely governed by phonons; accordingly, we expect 
its specific heat to be given by, see equations (16) and (23), 


cy = 



where 


67t 2 N\ 

—) 6 


(24) 


(25) 


c being the velocity of sound in the liquid. The specific heat per unit mass of the liquid is 
then given by 


c v = 


2n 2 k 4 
15 pfi 3 c 3 


r 3 , 


(26) 


where p is the mass density. Substituting p = 0.1455g/cm 3 and c = 238 m/s, the foregoing 
result becomes 


Cy = 0.0209T 3 jouleg _1 K _1 . (27) 

The experimental measurements of Wiebes et al. (1957), for 0 < T < 0.6K, conformed to 
the expression 


Cy = (0.0204 ± 0.0004) T 3 joule g~ 1 K“ 1 . (28) 

The agreement between the theoretical result and the experimental observations is clearly 
good. 


7.5 Inertial density of the sound field 

For further understanding of the low-temperature behavior of liquid He 4 , we determine 
the “inertial mass” associated with a gas of sound quanta in thermal equilibrium. For 
this, we consider “a phonon gas in mass motion,” for then by determining the relation¬ 
ship between the momentum P of the gas and the velocity v of its mass motion we can 
readily evaluate the property in question. Now, since the total number of phonons in the 
system is indefinite, the problem is free from the constraint of a fixed N; consequently, 
the undetermined multiplier a may be taken to be identically zero. However, we now have 
a new constraint on the system, namely that of a fixed total momentum P, additional to 
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the constraint of the fixed total energy E. Under these constraints, the mean occupation 
number of the phonon level s(p) would be 


(n(p)) = 


1 

expels + y ■ p) - 1' 


(1) 


As usual, the parameter ft is equal to 1/kT. To determine y, it seems natural to evaluate 
the drift velocity of the gas. Choosing the z-axis in the direction of the mass motion, the 
magnitude v of the drift velocity will be given by “the mean value of the component u z of 
the individual phonon velocities”: 


V = { ucosO). 


( 2 ) 


Now, for phonons 


e = pc and u= — = c, (3) 

dp 

where c is the velocity of sound in the medium. Moreover, by reasons of symmetry, we 
expect the undetermined vector y to be either parallel or antiparallel to the direction of 
mass motion; hence, we may write 


y p = y z pz = y z pcos9. 

In view of equations (1), (3), and (4), equation (2) becomes 

Io° Jo [ ex p{/Spc(l + (y z //3c) cos9)} - l]~ * 1 (ccos9)(p 2 dp2n sin 9d 9) 

fo° fo [ ex P(/0P c (l + (y z /Pc) case)} - \]-Hp 2 dp2n sin9d9) 

Making the substitutions 

COS(9 = rj, p(l + (y z /Pc)r)) = p' 


and cancelling away the integrations over p ', we obtain 

_ /^i(l + (y Z /Pc)rj)- 3 ridr) _ 
/il(l + (y Z /Pc)ri)- 3 dr) 

It follows that 


(4) 


(5) 


y = -pv. 

Accordingly, the expression for the mean occupation number becomes 

1 

{ n(p )) =-. 

p exp{/i(e — v ■ p)} — 1 


( 6 ) 


( 7 ) 
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A comparison of (7) with the corresponding result in the rest frame of the gas, namely 

<»0(P 0 )) = - 7W— -T< (8) 

expijSeo) - 1 

shows that the change caused by the imposition of mass motion on the system is noth¬ 
ing but a straightforward manifestation of the Galilean transformation between the two 
frames of reference. 

Alternatively, equation (7) maybe written as 


exp(f)p'c) — 1 exp{/3pc(l — (v/c)cos6)} — 1 ' 

As such, formula (9) lays down a serious restriction on the drift velocity v, that is, it must 
not exceed c, the velocity of the phonons, for otherwise some of the occupation num¬ 
bers would become negative! Actually, as our subsequent analysis will show, the formalism 
developed in this section breaks down as v approaches c. The velocity c may, therefore, be 
regarded as the critical velocity for the flow of the phonon gas: 


(Z'c)ph = c. (10) 

The relevance of this result to the problem of superfluidity in liquid helium II will be seen 
in the following section. 

Next we now calculate the total momentum P of the phonon gas: 

P = ’52WP))P- cm 

p 

Indeed, the vector P will be parallel to the vector v, the latter being already in the direction 
of the z-axis. We have, therefore, to calculate only the z-component of the momentum: 


P = P Z = ^2{n(p))Pz 

p 

p cos0 / Vp 2 dp2n sindd9 

exp{fipc(l - (i//c)cos#)} - 1 \ h 3 

oo x 



OO 

2tcV r p' 3 dp’ 

h 3 J exp(Pp'c) - 1 
o 


n 

J {1 — (v/c)cosd}~ 4 cosO sinOdd 
o 


16tt 5 v/c 2 

V 45h 3 c 3 /3 4 ' (1 - v 2 /c 2 ) 3 ' 


( 12 ) 
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The total energy E of the gas is given by 
e = j2( n( ~p))p c 


2nVc 

h 3 


CXJ Jl 

f -- f {1 — (v/c)cos9}~ 4 sinddO 

J expire) - 1 J 


= V 


o 

4tt 5 


1 + lv 2 /c 2 


15 h 3 c 3 ^ (1 -v 2 /c 2 ) 3 ' 


(13) 


It is now natural to regard the ratio P/v as the “inertial mass” of the phonon gas. The 
corresponding mass density p is, therefore, given by 




i67r 5 /c 4 r 4 


vV 45 h 3 c 5 (1 — v 2 /c 2 ) 3 

For ( v/c ) «; 1, which is generally true, the mass density of the phonon gas is given by 

16?r 5 fc 4 4 4 

(P °) ph “ 45f^ “ 3 ^ (Eo/V) - 


(14) 


(15) 


Substituting the value of c for liquid He 4 at low temperatures, the phonon mass density, as 
a fraction of the actual density of the liquid, is given by 


(A))ph/PHe — 1.22 x 10~ 4 T 4 ; (16) 

thus, for example, at T = 0.3 K the value of this fraction turns out to be about 9.9 x 10 7 . 
Now, at a temperature like 0.3 K, phonons are the only excitations in liquid He 4 that need to 
be considered; the calculated result should, therefore, correspond to the “ratio of the den¬ 
sity p n of the normal fluid in the liquid to the total density p of the liquid.” It is practically 
impossible to make a direct determination of a fraction as small as that; however, indirect 
evaluations that make use of other experimentally viable properties of the liquid provide a 
striking confirmation of the foregoing result; see Figure 7.17. 


7.6 Elementary excitations in liquid helium II 

Landau (1941, 1947) developed a simple theoretical scheme that explains reasonably well 
the behavior of liquid helium II at low temperatures not too close to the X -point. Accord¬ 
ing to this scheme, the liquid is treated as a weakly excited quantum-mechanical system, 
in which deviations from the ground state (T = 0 K) are described in terms of “a gas of ele¬ 
mentary excitations” hovering over a quiescent background. The gas of excitations corre¬ 
sponds to the “normal fluid,” while the quiescent background represents the “superfluid.” 
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FIGURE 7.17 The normal fraction (p n /p), as obtained from experimental data on (i) the velocity of second sound 
and (ii) the entropy of liquid He II (after de Klerk, Hudson, and Pellam, 1953). 


At T = OK, there are no excitations at all (p n = 0) and the whole of the fluid constitutes the 
superfluid background (p s = pne)- At higher temperatures, we may write 

Ps(T) = PHe(T) ~ Pn(T), (1) 

so that at T — T,_, p n = p H e and p s = 0. At T > T ,, the liquid behaves in all respects as a 
normal fluid, commonly known as liquid helium I. 

Guided by purely empirical considerations, Landau also proposed an energy- 
momentum relationship e(p) for the elementary excitations in liquid helium II. At 
low momenta, the relationship between e and p was linear (which is characteristic of 
phonons), while at higher momenta it exhibited a nonmonotonic character. The excita¬ 
tions were assumed to be bosons and, at low temperatures (when their number is not very 
large), mutually noninteracting; the macroscopic properties of the liquid could then be 
calculated by following a straightforward statistical-mechanical approach. It was found 
that Landau’s theory could explain quite successfully the observed properties of liquid 
helium II over a temperature range of about 0 to 2 K; however, it still remained to be ver¬ 
ified that the actual excitations in the liquid did, in fact, conform to the proposed energy 
spectrum. 

Following a suggestion by Cohen and Feynman (1957), a number of experimental work¬ 
ers set out to investigate the spectrum of excitations in liquid helium II by scattering 
long-wavelength neutrons (A. > 4A) from the liquid. At temperatures below 2 K, the most 
important scattering process is the one in which a neutron creates a single excitation in the 
liquid. By measuring the modified wavelength kp of the neutrons scattered at an angle </>, 
the energy e and the momentum p of the excitation created in the scattering process could 
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be determined on the basis of the relevant conservation laws: 


s = h 2 (XT 2 -Xj 2 )/2m, (2) 

p 2 = h 2 (X/ 2 + XJ 2 — 2X/ 1 XJ 1 c os<p), (3) 

where A., is the initial wavelength of the neutrons and m the neutron mass. By varying <p, or 
Xj, one could map the entire spectrum of the excitations. 

The first exhaustive investigation along these lines was carried out by Yarnell et al. 
(1959); their results, shown in Figure 7.18, possess a striking resemblance to the empiri¬ 
cal spectrum proposed by Landau. The more important features of the spectrum, which 
was obtained at a temperature of 1.1 K, are the following: 

(i) If we fit a linear, phonon-like spectrum (e = pc) to points in the vicinity of 
p/h = 0.55A , we obtain for c a value of (239 ± 5) m/s, which is in excellent 
agreement with the measured value of the velocity of sound in the liquid, namely 
about 238 m/s. 

(ii) The spectrum passes through a maximum value of s/k = (13.92 ± 0.10) K at 

p//t=(l.ll±0.02)A _1 . 

(iii) This is followed by a minimum at p/h = (1.92 ± 0.01)A , whose neighborhood may 
be represented by Landau’s roton spectrum: 


s(p ) = A + 


(P~Po) 2 

2p 


(4) 



FIGURE 7.18 The energy spectrum of the elementary excitations in liquid He II at 1.1 K [after Yarnell et al. (1959)]; 
the dashed line emanating from the origin has a slope corresponding to the velocity of sound in the liquid, namely 
(239 ± 5) m/s. 
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with 


A//c= (8.65 ± 0.04)K, 

p 0 /h= (1.92±0.01)A _1 , (5) 21 


and 


/x = (0.16 ± 0.01)m He . 

(iv) Above p/h ~ 2.18 A 1 , the spectrum rises linearly, again with a slope equal to c. Data 
were also obtained at temperatures 1.6 K and 1.8K. The spectrum was found to be of 
the same general shape as at 1.1 K; only the value of A was slightly lower. 

In a later investigation, Henshaw and Woods (1961) extended the range of observation 
at both ends of the spectrum; their results are shown in Figure 7.19. On the lower side, 
they carried out measurements down to p/h — 0.26A 1 and found that the experimental 
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FIGURE 7.19 The energy spectrum of the elementary excitations in liquid He II at 1.12K (after Henshaw and Woods, 
1961); the dashed straight lines have a common slope corresponding to the velocity of sound in the liquid, namely 
237m/s. The parabolic curve rising from the origin represents the energy spectrum, e(p) = p 2 /2m, of free helium 
atoms. 

21 The term “roton" for these excitations was coined by Landau who had originally thought that these excitations 
might, in some way, represent local disturbances of a rotational character in the liquid. However, subsequent theoretical 
work, especially that of Feynman (1953, 1954) and of Brueckner and Sawada (1957), did not support this contention. 
Nevertheless, the term “roton” has remained. 
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points indeed lie on a straight line (of slope 237 m/s). On the upper side, they pushed their 
measurements up to p/h — 2.68A 1 and found that, after passing through a minimum at 
1.91 A , the curve rises with an increasing slope up to about 2.4 A 1 at which point the 
second derivative d 2 s/dp 2 changes sign; the subsequent trend of the curve suggests the 
possible existence of a second maximum in the spectrum! 22 

To evaluate the thermodynamics of liquid helium II, we first of all note that at suf¬ 
ficiently low temperatures we have only low-lying excitations, namely the phonons. 
The thermodynamic behavior of the liquid is then governed by formulae derived in 
Sections 7.4 and 7.5. At temperatures higher than about 0.5 K, the second group of exci¬ 
tations, namely the rotons (with momenta in the vicinity of po ), also shows up. Between 
0.5 K and about 1K, the behavior of the liquid is governed by phonons and rotons together. 
Above 1K, however, the phonon contributions to the various thermodynamic properties 
of the liquid become rather unimportant; then, rotons are the only excitations that need 
to be considered. 

We shall now study the temperature dependence of the roton contributions to the 
various thermodynamic properties of the liquid. In view of the continuity of the energy 
spectrum, it is natural to expect that, like phonons, rotons also obey Bose-Einstein statis¬ 
tics. Moreover, their total number N in the system is quite indefinite; consequently, their 
chemical potential p is identically zero. We then have for the mean occupation numbers 
of the rotons 


(n(p)) = 


1 

exp{^e(p)}- 1’ 


( 6 ) 


where s(p) is given by equations (4) and (5). Now, at all temperatures of interest (namely 
T < 2K), the minimum value of the term exp{/3e(p)}, namely exp(A//cT), is considerably 
larger than unity. We may, therefore, write 


( n(p )) ~ exp{-jSe(p)). (7) 

The (-/-potential of the system of rotons is, therefore, given by 

py _ _ _ 

q(V, T) = — = - "Y InU - exp{-/?s(p)}] ~ exp {~Pe(p)} ~ N, (8) 

P P 

where N is the “equilibrium” number of rotons in the system. The summation over p may 
be replaced by integration, with the result 


OO 



0 


A+ 


(p-po> 2 

w 



( 4np 2 dp ). 


(9) 


22 This seems to confirm a remarkable prediction by Pitaevskii (1959) that an end point in the spectrum might occur 
at a "critical” value p c of the excitation momentum where s c is equal to 2 A and (3e/3p) c is zero. 
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Substituting p = p 0 + (2pkT) 1/2 x, we get 

py = _ = ^^ e -A/kT {2llkT) i/21 e -x> j! + ^ x | dx (10) 

The “relevant” range of the variable x that makes a significant contribution toward this 
integral is fairly symmetric about the value x — 0; consequently, the net effect of the linear 
term in the integrand is vanishingly small. The quadratic term too is unimportant because 
its coefficient (2 pkT)/p^ <£; 1. Thus, all we have to consider is the integral of exp(—x 2 ). Now, 
one can readily verify that the limits of this integral are such that, without seriously affect¬ 
ing the value of the integral, they may be taken as — oo and +oo; the value of the integral is 
then simply 7T 1 / 2 . We thus obtain 

^ = N = 4U ^ V (2n pkT) l/2 e~ A/kT . (II ) 23 

The free energy of the roton gas is given by (since p = 0) 

A = -PV = —NkT oc T 3/2 e~ A/kT , (12) 


which gives 


= a+ts = n( A + ^fcr 


= Nk 


3 ,A, 

2 kT\’ 


(13) 

(14) 24 


and 


Cy = 



3 A 

4 + kT + 



(15) 


Clearly, as T -»• 0, all these results tend to zero (essentially exponentially). 

We now determine the inertial mass density of the roton gas. Proceeding as in 
Section 7.5, we obtain for a gas of excitations with energy spectrum e(p) 


Mo 1 [ d 3 p 

po = = hrn / n(s-v-p)p—^-, 

V v^ov J h 6 


(16) 


23 Looking back at integral (9), what we have done here amounts to replacing p 2 in the integrand by its mean value p% 
and then carrying out integration over the “complete" range of the variable (p - po). 

24 This result is highly suggestive of the fact that for rotons there is only one true degree of freedom, namely the 
magnitude of the roton momentum, that is thermally effective! 
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where n(e -vp) is the mean occupation number of the state s(p), as observed in a frame 
of reference K with respect to which the gas is in mass motion with a drift velocity v. 25 
For small v, the function n(s — v p) may be expanded as a Taylor series in v and only the 
terms n(e) — (v ■ p)dn{s)/ds retained. The integral over the first part denotes the momen¬ 
tum density of the system, as observed in the rest frame Kq, and is identically zero. We are 
thus left with 


po = — -pr f p 2 cos 2 o'^—^ ] -(p 2 dp2n sinOdd) 
ri' J 3 s 


OO 



0 


(17) 


which holds for any energy spectrum and for any statistics. 
For phonons, we obtain 


4n f 

(P0)ph = _ 3 WcJ 


An C dn(p ) 4 , 

1 p dp 


dp 


An 
3 Wc 


n(p) ■ p 


1 f o 

— / n(p) ■ 4p dp 


CXJ 

f‘ 


= ^2 / n W-P C 


Anp 2 dp\ A 


h 3 


= ^2( £ 0)ph/^. 


(18) 


which is identical to our earlier result (7.5.15). 

For rotons, n(s) ~ exp(-j6£); hence, dn(e)/ds ~ -fin(e). Accordingly, by (17), 


(19) 

( 20 ) 

At very low temperatures (T < 0.3 K), the roton contribution toward the inertia of the 
fluid is negligible in comparison with the phonon contribution. At relatively higher tem¬ 
peratures (T ~ 0.6 K), the two contributions become comparable. At temperatures above 
1K, the roton contribution is far more dominant than the phonon contribution; at such 
temperatures, the roton density alone accounts for the density p n of the normal fluid. 


AnB t . 
(Po)rot = .,,3 / n(e)p dp 


N 


Pi n 


= ~ (P )— — —r - 

3 7 V 3 kT V 

_ Anp 4 0 t2np \ 1/2 \ /kT _ 

3 h 3 \ kT ) 


25 The drift velocity v must satisfy the condition (v ■ p) < s, for otherwise some of the occupation numbers will become 
negative! This leads to the existence of a critical velocity v c for these excitations, such that for v exceeding v c the formalism 
developed here would break down. It is not difficult to see that this (critical) velocity is given by the relation v c = (e/p)min, 
as in equation (24). 
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It would be instructive to determine the critical temperature T c at which the theoreti¬ 
cal value of the density p n became equal to the actual density p\\ e of the liquid; this would 
mean the disappearance of the superfluid component of the liquid (and hence a transition 
from liquid He II to liquid He I). In this manner, we find that T c — 2.5K, as opposed to the 
experimental value of T k , which is ~ 2.19 K. The comparison is not too bad, considering 
the fact that in the present calculation we have assumed the roton gas to be a noninteract¬ 
ing system right up to the transition point; in fact, due to the presence of an exceedingly 
large number of excitations at higher temperatures, this assumption would not remain 
plausible. 

Equation (19) suggests that a roton excitation possesses an effective mass p^/3kT. 
Numerically, this is about 10 to 15 times the mass of a helium atom (and, hence, orders of 
magnitude larger than the parameter p of the roton spectrum). However, the more impor¬ 
tant aspect of the roton effective mass is that it is inversely proportional to the temperature 
of the roton gas! Historically, this aspect was first discovered empirically by Landau (1947) 
on the basis of the experimental data on the velocity of second sound in liquid He II and its 
specific heat. Now, since the effective mass of an excitation is generally determined by the 
quantity {p 2 ) /3 kT, Landau concluded that the quantity Ip 2 ) of the relevant excitations in 
this liquid must be temperature-independent. Thus, as the temperature of the liquid rises, 
the mean value of p 2 of the excitations must stay constant; this value may be denoted by 
Pq. The mean value of e, on the other hand, must rise with temperature. The only way to 
reconcile the two was to invoke a nonmonotonic spectrum with a minimum at p = p 0 . 

Linally, we would like to touch on the question of the critical velocity of superflow. 
For this, we consider a mass M of excitation-free superfluid in mass motion; its kinetic 
energy E and momentum P are given by \Mv 2 and Mv, respectively. Any changes in these 
quantities are related as follows: 


8E=(v-8P). (21) 

Supposing that these changes came about as a result of the creation of an excitation s(p) 
in the fluid, we must have, by the principles of conservation, 

8E = — s and SP = — p. (22) 

Equations (21) and (22) lead to the result 


e = (v■ p) < vp. (23) 

Thus, it is impossible to create an excitation s(p) in the fluid unless the drift velocity v of 
the fluid is greater than, or at least equal to, the quantity (s/p). Accordingly, if v is less than 
even the lowest value of s/p, no excitation at all can be created in the fluid, which will there¬ 
fore maintain its superfluid character. We thus obtain a condition for the maintenance of 
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superfluidity, namely 


V<V C = (s/p) min. (24) 

which is known as the Landau criterion for superflow. The velocity v c is called the criti¬ 
cal velocity of superflow; it marks an “upper limit" to the flow velocities at which the fluid 
exhibits superfluid behavior. The observed magnitude of the critical velocity varies signifi¬ 
cantly with the geometry of the channel employed; as a rule, the narrower the channel the 
larger the critical velocity. The observed values of v c range from about 0.1 cm/s to about 
70cm/s. 

The theoretical estimates of v c are clearly of interest. On one hand, we find that if the 
excitations obey the ideal-gas relationship, namely e = p 2 /2m, then the critical velocity 
turns out to be exactly zero. Any velocity v is then greater than the critical velocity; accord¬ 
ingly, no superflow is possible at all. This is a very significant result, for it brings out very 
clearly the fact that interatomic interactions in the liquid, which give rise to an excita¬ 
tion spectrum different from the one characteristic of the ideal gas, play a fundamental 
role in bringing about the phenomenon of superfluidity. Thus, while an ideal Bose gas 
does undergo the phenomenon of Bose-Einstein condensation, it cannot support the phe¬ 
nomenon of superfluidity as such! On the other hand, we find that (i) for phonons, v c — 
c~ 2.4 x 10 4 cm/s and (ii) forrotons, v c = {(p^ + 2p A) 1/2 - po}/p — A/po — 6.3 x 10 3 cm/s, 
which are too high in comparison with the observed values of v c . In fact, there is another 
type of collective excitations that can appear in liquid helium II, namely quantized vortex 
rings, with energy-momentum relationship of the form: e cx p ] /2 . The critical velocity for 
the creation of these rings turns out to be numerically consistent with the experimental 
findings; not only that, the dependence of v c on the geometry of the channel can also be 
understood in terms of the size of the rings created. 

For a review of this topic, especially in regard to Feynman’s contributions, see Mehra 
and Pathria (1994); see also Sections 11.4 through 11.6 of this text. 

Quantized dissipationless bosonic flow has also been observed in the solid phase of 
helium-4. This “supersolid” behavior was observed by Kim and Chan (2004a, 2004b) using 
a torsional oscillator containing solid helium infused silica with atomic-sized pores. At P — 
60 atm, the torsional frequency increases abruptly for temperatures below 175 mK. These 
authors interpret this result as helium atoms in the solid phase in the pores being free to 
flow without dissipation. 


Problems 

7 . 1 . By considering the order of magnitude of the occupation numbers («,), show that it makes no 
difference to the final results of Section 7.1 if we combine a finite number of (e ^ 0)-terms of the 
sum (7.1.2) with the (e = 0)-part of equation (7.1.6) or include them in the integral over e. 

7 . 2 . Deduce the virial expansion (7.1.13) from equations (7.1.7) and (7.1.8), and verify the quoted values 
of the virial coefficients. 
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7.3. Combining equations (7.1.24) and (7.1.26), and making use of the first two terms of formula (D.9) 
in Appendix D, show that, as T approaches T c from above, the parameter a(= —Inz ) of the ideal 
Bose gas assumes the form 


a 


1 / (3/2) 

tt \ 4 



7.4. Show that for an ideal Bose gas 


l/3z\ 5 g 5 / 2 (z) . 

z\dTJ p 2T g 3/2 (z)’ 


compare this result with equation (7.1.36). Hence show that 


Cp _ (< dz/dT)p _ 5 g 5 / 2 (z)g\/ 2 (z) 
C v (dz/BT) v 3 {g 3 / 2 (z)l 2 


as in equation (7.1.48b). Check that, as T approaches T c from above, both y and Cp diverge as 
C T-T c )-K 

7.5. (a) Show that the isothermal compressibility kj and the adiabatic compressibility ks of an ideal 
Bose gas are given by 

1 gi /2 (z) 3 g 3 / 2 (z) 

KT ~ nJcT g 3 / 2 (z)' KS ~ 5nkT g 5/2 (z) ’ 


where «(= N/V) is the particle density in the gas. Note that, as z ->■ 0, kj and ks approach their 
respective classical values, namely 1/P and 1/yP. How do they behave as z -»■ 1? 

(b) Making use of the thermodynamic relations 


C P -Cv = T 



/dV 

\dT 



2 

V 


and 


Cp/Cy = kj/ks, 

derive equations (7.1.48a) and (7.1.48b). 

7.6. Show that for an ideal Bose gas the temperature derivative of the specific heat Cy is given by 


1 / dCy \ 
Nk \ BT ) v 


1 
T 
45 v 
¥ TA3 


45 g 5/2 jz) 

8 g3/ 2 (Z) 

■' f(i) 


9 g3/ 2 (Z) 

4g!/ 2 (z) 


27 !g3/2(Z)} 2 g-l/ 2 (.Z) 
8 lgl/2(Z)} 3 


for T > T c , 


for T < T c . 


Using these results and the main term of formula (D.9), verify equation (7.1.38). 

7.7. Evaluate the quantities (B 2 P/3T 2 ) V , (3 2 /x/3T 2 )„, and ( B 2 pl/BT 2 )p for an ideal Bose gas and check 
that your results satisfy the thermodynamic relationships 


3 2 P 


C v =VT\ — \ -NT\ —£ 


BT 2 


3 2 /r 


BT 2 


and 


C P = -NT 



Examine the behavior of these quantities as T -*■ T c from above and from below. 
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7.8. The velocity of sound in a fluid is given by the formula 

w = y/(dP/dp) s , 


where p is the mass density of the fluid. Show that for an ideal Bose gas 


W 2 — g5/2(Z) 

~ 3m g 3/2 (z) 


5 

9 


(u 2 ), 


where (u 2 ) is the mean square speed of the particles in the gas. 
7.9. Show that for an ideal Bose gas 


(u) 


4 gi(z)g 2 (z ) 

* ig3/2(Z)} 2 ’ 


u being the speed of a particle. Examine and interpret the limiting cases z -> 0 and z -*■ 1; 
compare with Problem 6.6. 

7.10. Consider an ideal Bose gas in a uniform gravitational field of acceleration g. Show that the 
phenomenon of Bose-Einstein condensation in this gas sets in at a temperature T c given by 


r, ~ r! 


8 1 /ntngL\ l/2 

9f(|)V kT° ) 


where L is the height of the container and mgL <K kT°. Also show that the condensation here is 
accompanied by a discontinuity in the specific heat Cy of the gas: 


(ACv)t=t c — - 


8jt 



it mgL\ l/2 

ia<r) : 


see Eisenschitz (1958). 

7.11. Consider an ideal Bose gas consisting of molecules with internal degrees of freedom. Assuming 
that, besides the ground state so = 0, only the first excited state s i of the internal spectrum needs 
to be taken into account, determine the condensation temperature of the gas as a function of si. 
Show that, for (si/kT°) » 1, 


while, for (si/fcr°) <K 1, 


Te 

r r0 


1 



e -£i/fcr“ 



2 4 / 3 /wsi \ 1/2 

3 ? (i)WoJ 


[Hint: To obtain the last result, use the first two terms of formula (D.9) in Appendix D.] 

7.12. Consider an ideal Bose gas in the grand canonical ensemble and study fluctuations in the total 
number of particles N and the total energy E. Discuss, in particular, the situation when the gas 
becomes highly degenerate. 

7.13. Consider an ideal Bose gas confined to a region of area A in two dimensions. Express the number of 
particles in the excited states, N e , and the number of particles in the ground state, N 0 , in terms of 

z, T, and A, and show that the system does not exhibit Bose-Einstein condensation unless T 0 K. 

Refine your argument to show that, if the area A and the total number of particles N are held 
fixed and we require both N e and No to be of order N, then we do achieve condensation when 


h 2 1 


T 


mkl 2 InAl 
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where l [~ ,/( A/N )] is the mean interparticle distance in the system. Of course, if both A and 
N -> oo, keeping l fixed, then the desired T does go to zero. 

7.14. Consider an n-dimensional Bose gas whose single-particle energy spectrum is given by e a p s , 

where s is some positive number. Discuss the onset of Bose-Einstein condensation in this system, 
especially its dependence on the numbers n and .s'. Study the thermodynamic behavior of this 
system and show that, 



7.15. At time t = 0, the ground state wavefunction of a one-dimensional quantum harmonic oscillator 
with potential V(x) = i mw^x 2 is given by 



where a = J At t = 0, the harmonic potential is abruptly removed. Use the momentum 
representation of the wavefunction at t = 0 and the time-dependent Schrodinger equation to 
determine the spatial wavefunction and density at time t > 0; compare to equation (7.2.11). 

7.16. At time t = 0, a collection of classical particles is in equilibrium at temperature T in a three- 
dimensional harmonic oscillator potential V(r) = \mci>Q \r\ 2 . At t = 0, the harmonic potential is 
abruptly removed. Use the momentum distribution at f = 0 to determine the spatial density at 
time t > 0. Show that this is equivalent to the high temperature limit of equation (7.2.15). 

7.17. As shown in Section 7.1, nX 3 is a measure of the quantum nature of the system. Use equations 
(7.2.11) and (7.2.15) to determine nX 3 at the center of the harmonic trap at T = T c /2 for the 
condensed and noncondensed fractions. 

7.18. Show that the integral of the semiclassical spatial density in equation (7.2.15) gives the correct 
counting of the atoms that are not condensed into the ground state. 

7.19. Construct a theory for N bosons in an isotropic two-dimensional trap. This corresponds to a trap in 
which the energy level spacing due to excitations in the z direction is much larger than the spacing 
in the other directions. Determine the density of states a(e) of this system. Can a Bose-Einstein 
condensate form in this trap? If so, find the critical temperature as a function of the trapping 
frequencies and N. How much larger must the frequency in the third direction be for the system 

to display two-dimensional behavior? 

7.20. The (canonical) partition function of the blackbody radiation may be written as 


Q( V, T) = H Qi (ox T) 


CO 


so that 


OO 



here, Qi (m, T) is the single-oscillator partition function given by equation (3.8.14) and g(a>) is the 
density of states given by equation (7.3.2). Using this information, evaluate the Helmholtz free 
energy of the system and derive other thermodynamic properties such as the pressure P and the 
(thermal) energy density U/V. Compare your results with the ones derived in Section 7.3 from the 
(/-potential of the system. 

7.21. Show that the mean energy per photon in a blackbody radiation cavity is very nearly 2.7 kT. 

7.22. Considering the volume dependence of the frequencies u> of the vibrational modes of the radiation 
field, establish relation (7.3.17) between the pressure P and the energy density U/V. 

7.23. The sun may be regarded as a black body at a temperature of 5800 K. Its diameter is about 
1.4 x 10 9 m while its distance from the earth is about 1.5 x 10 n m. 
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(a) Calculate the total radiant intensity (in W/m 2 ) of sunlight at the surface of the earth. 

(b) What pressure would it exert on a perfectly absorbing surface placed normal to the rays of 
the sun? 

(c) If a flat surface on a satellite, which faces the sun, were an ideal absorber and emitter, what 
equilibrium temperature would it ultimately attain? 

7.24. Calculate the photon number density, entropy density, and energy density of the 2.725 K cosmic 
microwave background. 

7.25. Figure 7.20 is a plot of Cv(T) against T for a solid, the limiting value Cv(oo) being the classical 
result 3Nk. Show that the shaded area in the figure, namely 

OO 

J {C v (oo)-C v (T)}dT, 

o 

is exactly equal to the zero-point energy of the solid. Interpret the result physically. 



FIGURE 7.20 


7.26. Show that the zero-point energy of a Debye solid composed of N atoms is equal to ^Nk&o- 

[Note that this implies, for each vibrational mode of the solid, a mean zero-point energy ^k&o, 
that is, m= \o>d-] 

7.27. Show that, for T <g; &d, the quantity (Cp — Cv) of a Debye solid varies as T 7 and hence the ratio 
(Cp/Cv) — 1. 

7.28. Determine the temperature T, in terms of the Debye temperature @d, at which one-half of the 
oscillators in a Debye solid are expected to be in the excited states. 

7.29. Determine the value of the parameter @d for liquid He 4 from the empirical result (7.4.28). 

7.30. (a) Compare the “mean thermal wavelength” at of neutrons at a typical room temperature with 

the “minimum wavelength” k m i n of phonons in a typical crystal. 

(b) Show that the frequency wd for a sodium chloride crystal is of the same order of magnitude as 
the frequency of an electromagnetic wave in the infrared. 

7.31. Proceeding under conditions (7.4.16) rather than (7.4.13), show that 

C V (T) = Nk{D(x 0 , L ) + 2D(x 0 , t )}, 

where x 0i p = (hmo^/kT) and x 0 j = (^d.t/ kT). Compare this result with equation (7.4.17), and 
estimate the nature and the magnitude of the error involved in the latter. 

7.32. A mechanical system consisting of n identical masses (each of mass rn) connected in a straight line 
by identical springs (of stiffness K ) has natural vibrational frequencies given by 



1,2,...(«—l). 
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Correspondingly, a linear molecule composed of n identical atoms may be regarded as having a 
vibrational spectrum given by 


/ r 7z\ 

v r = v c sin( - • -);r= l,2,...(n- 1), 
v n 2 / 


where v c is a characteristic vibrational frequency of the molecule. Show that this model leads to a 
vibrational specific heat per molecule that varies as T l at low temperatures and tends to the 
limiting value (n — l)fc at high temperatures. 

7.33. Assuming the dispersion relation a> = Akf, where w is the angular frequency and k the wave number 
of a vibrational mode existing in a solid, show that the respective contribution toward the specific 
heat of the solid at low temperatures is proportional to T 3 / s . 

[Note that while s = 1 corresponds to the case of elastic waves in a lattice, s = 2 applies to spin 
waves propagating in a ferromagnetic system.] 

7.34. Assuming the excitations to be phonons (w = Ak), show that their contribution toward the specific 
heat of an n-dimensional Debye system is proportional to T". 

[Note that the elements selenium and tellurium form crystals in which atomic chains are arranged 
in parallel so that in a certain sense they behave as one-dimensional; accordingly, over a certain 
range of temperatures, the T l -law holds. For a similar reason, graphite obeys a T 2 -law over a 
certain range of temperatures.] 

7.35. The (minimum) potential energy of a solid, when all its atoms are “at rest” at their equilibrium 
positions, may be denoted by the symbol O 0 (FT, where V is the volume of the solid. Similarly, the 
normal frequencies of vibration, a>j(i = 1,2,..., 3 N — 6), maybe denoted by the symbols a>i(V). 
Show that the pressure of this solid is given by 


where U' is the internal energy of the solid arising from the vibrations of the atoms, while y is the 
Gruneisen constant : 


Assuming that, for V ~ V 0 , 


4>o(V) = 


(V-V 0 ) 2 


where kq and Vo are constants and kqCvT <jc Vo. show that the coefficient of thermal expansion (at 
constant pressure P ~ 0) is given by 

- 1 ( _ y K oCv 
a= V\~df) N n~ V 0 ' 


Also show that 


Cp — Cv = 


Y 2 KoCyT 


7.36. Apply the general formula (6.4.3) for the kinetic pressure of a gas, namely 

P= ^n(pu), 

to a gas of rotons and verify that the result so obtained agrees with the Boltzmannian relationship 
P = nkT. 

7.37. Show that the free energy A and the inertial density p of a roton gas in mass motion are given by 


A(v)=A(0)- 


x 
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and 


3(xcoshx — sinhx) 

p(v) = p(0)-=-- 


where x = vpo/kT. 

7.38. Integrating (7.6.17) by parts, show that the effective mass of an excitation, whose energy- 
momentum relationship is denoted by e(p), is given by 


"Jeff = 



d 

dp 



Check the validity of this result by considering the examples of (i) an ideal-gas particle, (ii) a 
phonon, and (iii) a roton. 






Ideal Fermi Systems 

8.1 Thermodynamic behavior of an ideal Fermi gas 

According to Sections 6.1 and 6.2, we obtain for an ideal Fermi gas 

PV 

— =ln© = ^]ln(l + ze _/i£ ) (1) 


and 


< 2 > 

e s 

where /? = 1/kT and z = exp (p/kT). Unlike the Bose case, the parameter z in the Fermi 
case can take on unrestricted values: 0 < z < oo. Moreover, in view of the Pauli exclusion 
principle, the question of a large number of particles occupying a single energy state does 
not even arise in this case; hence, there is no phenomenon like Bose-Einstein condensa¬ 
tion here. Nevertheless, at sufficiently low temperatures, Fermi gas displays its own brand 
of quantal behavior, a detailed study of which is of great physical interest. 

If we replace summations over e by corresponding integrations, equations (1) and (2) 
in the case of a nonrelativistic gas become 

W = J /5/2(z) (3) 

and 

N s 

V = J/3/2 (Z), (4) 

where g is a weight factor arising from the “internal structure” of the particles (e.g., spin), 
X is the mean thermal wavelength of the particles 

X = h/(2nmkT) 1/2 , (5) 


while f v (z) are Fermi-Dirac functions defined by, see Appendix E, 

oo - „ 

1 f x v x dx z 2 z 3 

^ v(z) = T>) J z~ 1 e x +1 = + ¥ ' 

0 


( 6 ) 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00008-6 
© 2011 Elsevier Ltd. All rights reserved. 
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Eliminating z between equations (3) and (4), we obtain the equation of state of the Fermi 
gas. 

The internal energy U of the Fermi gas is given by 


U=- —In® 

, 9 / 


z.V 


= lcT l [ — In <3 

,/5/2(X>. 


z,V 


thus, quite generally, this system satisfies the relationship 


P=-(U/V). 


(7) 


( 8 ) 


The specific heat Cy of the gas can be obtained by differentiating (7) with respect to T, 
keeping N and V constant, and making use of the relationship 


1 /92\ _ 3 /3/2(z) 

z\dTJ v 2T f i/2 (z)’ 


which follows from equation (4) and the recurrence formula (E.6) in Appendix E. The final 
result is 


Cy _ 15/5/2 (z) 9/i/2( 2 ) 

Nk~ 4 / 3 /2(2) 4 / i /2 (2) * 

For the Helmholtz free energy of the gas, we get 


and for the entropy 


A = Nu — PV = NkT \ In 2 - { 5/2(Z) I, 
I /i/2 (2) ] 


s = 


U-A 

T 


= Nk 


5/5/2 (2) 
2/3/2 (2) 



( 10 ) 


( 11 ) 


( 12 ) 


In order to determine the various properties of the Fermi gas in terms of the particle den¬ 
sity n (= N/V) and the temperature T, we need to know the functional dependence of the 
parameter z on n and T ; this information is formally contained in the implicit relationship 
(4). For detailed studies, one is sometimes obliged to make use of numerical evaluation 
of the functions f v (z); for physical understanding, however, the various limiting forms of 
these functions serve the purpose well (see Appendix E). 

Now, if the density of the gas is very low and/or its temperature very high, then the 
situation might correspond to 


/3/2 (2) = 


g 


nh 3 

g(2nmkT) 3 / 2 ^ 


(13) 
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we then speak of the gas as being nondegenerate and, therefore, equivalent to a classical 
ideal gas discussed in Section 3.5. In view of expansion (6), this implies that 1 and 
hence f v (z) — z. Expressions for the various thermodynamic properties of the gas then 
become 


and 


P = NkT/V, U = 7;NkT, C v = |lVfc, 


A = NkT 




(14) 

(15) 


(16) 


If the parameter z is small in comparison with unity but not very small, then we should 
make a fuller use of series (6) in order to eliminate z between equations (3) and (4) . The 
procedure is just the same as in the corresponding Bose case, that is, we first invert the 
series appearing in (4) to obtain an expansion for z in powers of (n):‘/g) and then substi¬ 
tute this expansion into the series appearing in (3) . The equation of state then takes the 
form of the virial expansion 


PV 

NkT 


la ‘ 

l=i 



(17) 


where v = l/n, while the coefficients fl; are the same as quoted in (7.1.14) but alternate in 
sign compared to the Bose case. For the specific heat, in particular, we obtain 


Cv = 3 NkY,(-l) l - l5 -^a l 


l=i 




1 - 0.0884 



+ 0.0066 




(18) 


Thus, at finite temperatures, the specific heat of the gas is smaller than its limiting value 
| Nk. As will be seen in the sequel, the specific heat of the ideal Fermi gas decreases mono- 
tonically as the temperature of the gas falls; see Figure 8.2 later in the section and compare 
it with the corresponding Figure 7.4 for the ideal Bose gas. 

If the density n and the temperature T are such that the parameter (nk : '/g) is of order 
unity, the foregoing expansions cannot be of much use. In that case, one may have to make 
recourse to numerical calculation. However, if ( nX 3 /g ) » 1, the functions involved can be 
expressed as asymptotic expansions in powers of (ln.z) -1 ; we then speak of the gas as being 
degenerate. As (nk 3 /g) -» oo, our functions assume a closed form, with the result that the 
expressions for the various thermodynamic quantities pertaining to the system become 






234 Chapter 8 • Ideal Fermi Systems 


highly simplified; we then speak of the gas as being completely degenerate. For simplicity, 
we first discuss the main features of the system in a state of complete degeneracy 

In the limit T -*■ 0, which implies (m?/g) -> oo, the mean occupation numbers of the 
single-particle state s(p) become 

(n s ) 

where is the chemical potential of the system at T = 0. The function (n f: ) is thus a step 
function that stays constant at the (highest) value 1 right from s = 0 to s = po and then 
suddenly drops to the (lowest) value 0; see the dotted line in Figure 8.1. Thus, at T = 0, 
all single-particle states up to s = p 0 are “completely” filled, with one particle per state 
(in accordance with the Pauli principle), while all single-particle states with s > p 0 are 
empty. The limiting energy po is generally referred to as the Fermi energy of the system 
and is denoted by the symbol sp\ the corresponding value of the single-particle momen¬ 
tum is referred to as the Fermi momentum and is denoted by the symbol pp . The defining 
equation for these parameters is 

ep 

J a(e)de = N, (20) 

o 

where a(s) denotes the density of states of the system and is given by the general expression 


1 


e (s-p)/kT _|_ i o 


for s < p o 
for e > po, 


(19) 




We readily obtain 


which gives 


N = 


4 ngV 
3 h 3 




Pf = 



( 21 ) 


( 22 ) 


(23) 



FIGURE 8.1 Fermi distribution at low temperatures, with x= s/kT and J = p/kT. The rectangle denotes the limiting 
distribution as T 0; in that case, the Fermi function is unity for e < p o and zero for e > pq. 
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accordingly, in the nonrelativistic case, 

_ / 3N \ 2/3 h 2 _ / 6jr 2 ?z\ 2/3 h 2 
Sp \4:ngV) 2m y g J 2m' 

The ground-state, or zero-point, energy of the system is then given by 



= 2n SV 5 
5 m/z 3 F ’ 


which gives 


(24) 


(25) 


Ep 3 p 2 3 

N 10m 5 SF ' 

The ground-state pressure of the system is in turn given by 

Po = 2(Eq/V) = \ne F . 

3 5 

Substituting for sf, the foregoing expression takes the form 


Po = 



2/3 h 2 

—n 5 ' 3 <xn 5 ' 3 . 

5m 


(26) 


(27) 


(28) 


The zero-point motion seen here is clearly a quantum effect arising because of the Pauli 
principle, according to which, even at T = 0 K, the particles constituting the system cannot 
settle down into a single energy state (as we had in the Bose case) and are therefore spread 
over a requisite number of lowest available energy states. As a result, the Fermi system, 
even at absolute zero, is quite live! 

For a discussion of properties such as the specific heat and the entropy of the system, 
we must extend our study to finite temperatures. If we decide to restrict ourselves to low 
temperatures, then deviations from the ground-state results will not be too large; accord¬ 
ingly, an analysis based on the asymptotic expansions of the functions/,, (z) would be quite 
appropriate. However, before we do that it seems useful to carry out a physical assessment 
of the situation with the help of the expression 


1 

“ e (s-F)/kT + i' 


(29) 


The situation corresponding to T = 0 is summarized in equation (19) and is shown as a step 
function in Figure 8.1. Deviations from this, when T is finite (but still much smaller than 
the characteristic temperature iH)/k), will be significant only for those values of e for which 
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the magnitude of the quantity (e — p)/kT is of order unity (for otherwise the exponential 
term in (29) will not be much different from its ground-state value, namely, e ±0 °); see the 
solid curve in Figure 8.1. 

We, therefore, conclude that the thermal excitation of the particles occurs only in a nar¬ 
row energy range that is located around the energy values = po and has a width O(kT). The 
fraction of the particles that are thermally excited is, therefore, 0(kT /ef) — the bulk of the 
system remaining uninfluenced by the rise in temperature. 1 This is the most characteris¬ 
tic feature of a degenerate Fermi system and is essentially responsible for both qualitative 
and quantitative differences between the physical behavior of this system and that of a 
corresponding classical system. 

To conclude the argument, we observe that since the thermal energy per “excited” par¬ 
ticle is O(kT), the thermal energy of the whole system will be 0{Nk 2 T 2 /sp); accordingly, the 
specific heat of the system will be 0(Nk ■ IcT /ef). Thus, the low-temperature specific heat 
of a Fermi system differs from the classical value \Nk by a factor that not only reduces 
it considerably in magnitude but also makes it temperature-dependent (varying as T 1 ). It 
will be seen repeatedly that the first-power dependence of Cy on T is a typical feature of 
Fermi systems at low temperatures. 

For an analytical study of the Fermi gas at finite, but low, temperatures, we observe 
that the value of z, which was infinitely large at absolute zero, is now finite, though still 
large in comparison with unity. The functions f v (z) can, therefore, be expressed as asymp¬ 
totic expansions in powers of (lnz) -1 ; see Sommerfeld’s lemma (E.17) in Appendix E. For 
the values of v we are presently interested in, namely |,|, and we have to the first 
approximation 


and 


/5/2(z) = 15^72 (lnz)5/2 


5n 2 2 

1+—(lnz) -2 + - 

O 


h/iiz) = —(biz) 3 / 2 


1 + — (lnz) -2 -|— 

O 


hi 2(z) = -^jzhnzfi 12 


1-(lnz) -2 + • 

24 


(30) 

(31) 


(32) 


Substituting (31) into (4), we obtain 


N 

V 


7(wT^ 2 


1 + — (lnz) 

O 


-2 


(33) 


'We, therefore, speak of the totality of the energy levels filled at T = 0 as “the Fermi sea" and the small fraction of 
the particles that are excited near the top, when T > 0, as a “mist above the sea.” Physically speaking, the origin of this 
behavior again lies in the Pauli exclusion principle, according to which a fermion of energy e cannot absorb a quantum of 
thermal excitation e t if the energy levels + et is already filled. Since bt = O(kT), only those fermions that occupy energy 
levels near the top level ef, up to a depth O(kT), can be thermally excited to go over to the unfilled energy levels. 
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In the zeroth approximation, this gives 


kTlnz = /r. ~ 


3AT \ 2/3 h 2 

4ngV) 2m’ 


(34) 


which is identical to the ground-state result p o = ef) see equation (24). In the next approxi¬ 
mation, we obtain 


kTlnz = /x — ef 


i n 2 fkT^ 2 ' 
1_ 12 ^ 


Substituting (30) and (31) into (7), we obtain 


— = -(fcrinz) 
N 5 


1+—dnz)- 2 + - 


with the help of (35), this becomes 


u 

3 

5?r 2 

( kT\ 2 

— = 

P B F 

1 H——- 

— i +•■■ 

N 

5 

12 

{bp J 


(35) 


(36) 


(37) 


The pressure of the gas is then given by 


2 U 

2 

5ji 2 

CM 

E-h 

-= 

-nsp 

1H ——— 

— +■■• 

3 V 

5 

12 

{ep/ 


(38) 


As expected, the main terms of equations (37) and (38) are identical to the ground-state 
results (26) and (27). From the temperature-dependent part of (37), we obtain for the low- 
temperature specific heat of the gas 


C v _ n 2 lcT 
Nk ~ Y Y + 


(39) 


Thus, for T <K 7>, where Tp(— ep/k) is the Fermi temperature of the system, the specific 
heat varies as the first power of temperature; moreover, in magnitude, it is consider¬ 
ably smaller than the classical value | Nk. The overall variation of Cv with T is shown in 
Figure 8.2. 

The Helmholtz free energy of the system follows directly from equations (35) and (38): 






2 


(40) 
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FIGURE 8.2 The specific heat of an ideal Fermi gas; the dotted line depicts the linear behavior at low temperatures. 


which gives 


S _ n 2 kT 
Nk ~ YJp + '"' 

Thus, as T -»• 0, S -*■ 0 in accordance with the third law of thermodynamics. 


(41) 


8.2 Magnetic behavior of an ideal Fermi gas 

We now turn our attention to studying the equilibrium state of a gas of noninteracting 
fermions in the presence of an external magnetic held B. The main problem here is to 
determine the net magnetic moment M acquired by the gas (as a function of B and T) and 
then calculate the susceptibility / (T). The answer naturally depends on the intrinsic mag¬ 
netic moment fi* of the particles and the corresponding multiplicity factor (2/ + 1); see, 
for instance, the treatment given in Section 3.9. According to the Boltzmannian treatment, 
one obtains a (positive) susceptibility x(T) which, at high temperatures, obeys the Curie 
law: x a T 1 ; at low temperatures, one obtains a state of magnetic saturation. However, 
if we treat the problem on the basis of Fermi statistics we obtain significantly different 
results, especially at low temperatures. 

In particular, since the Fermi gas is pretty live even at absolute zero, no magnetic sat¬ 
uration ever results; we rather obtain a limiting susceptibility xo, which is independent of 
temperature but is dependent on the density of the gas. Studies along these lines were first 
made by Pauli, in 1927, when he suggested that the conduction electrons in alkali metals 
be regarded as a “highly degenerate Fermi gas”; these studies enabled him to explain the 
physics behind the feeble and temperature-independent character of the paramagnetism of 
metals. Accordingly, this phenomenon is referred to as Pauli paramagnetism — in contrast 
to the classical Langevin paramagnetism. 

In quantum statistics, we encounter yet another effect which is totally absent in clas¬ 
sical statistics. This is diamagnetic in character and arises from the quantization of the 
orbits of charged particles in the presence of an external magnetic field or, one may say, 
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from the quantization of the (kinetic) energy of charged particles associated with their 
motion perpendicular to the direction of the field. The existence of this effect was first 
established by Landau (1930); so, we refer to it as Landau diamagnetism. This leads to an 
additional susceptibility x(T), which, though negative in sign, is somewhat similar to the 
paramagnetic susceptibility, in that it obeys Curie’s law at high temperatures and tends 
to a temperature-independent but density-dependent limiting value as T -»• 0. In gen¬ 
eral, the magnetic behavior of a Fermi gas is determined jointly by the intrinsic magnetic 
moment of the particles and the quantization of their orbits. If the spin-orbit interaction 
is negligible, the resultant behavior is given by a simple addition of the two effects. 

8.2.A Pauli paramagnetism 

The energy of a particle, in the presence of an external magnetic held B, is given by 



( 1 ) 


where g* is the intrinsic magnetic moment of the particle and m its mass. For simplicity, 
we assume that the particle spin is \; the vector g* will then be either parallel to the vector 
B or antiparallel. We thus have two groups of particles in the gas: 

(i) those having g* parallel to B, with e = p 2 /2m — g*B, and 

(ii) those having g* antiparallel to B, with e — p 2 /2m + g*B. 

At absolute zero, ah energy levels up to the Fermi level ep will be filled, while ah levels 
beyond sf will be empty. Accordingly, the kinetic energy of the particles in the first group 
will range between 0 and (sf + g*B), while the kinetic energy of the particles in the second 
group will range between 0 and (sf — g*B). The respective numbers of occupied energy 
levels (and hence of particles) in the two groups will, therefore, be 


N+= _ 3 \2miep + / r*B ))3/2 


( 2 ) 


and 


N- = ^{2m Sp -g*B)f' 2 . 


(3) 


The net magnetic moment acquired by the gas is then given by 



(4) 


We thus obtain for the low-field susceptibility (per unit volume) of the gas 



( 5 ) 






240 Chapter 8 • Ideal Fermi Systems 


Making use of formula (8.1.24), withg = 2, the foregoing result maybe written as 

Xo = ^n^* 2 /s F . (6) 

For comparison, the corresponding high-temperature result is given by equation (3.9.26), 
with g = 2 and / = \: 


Xoo = np* 2 /kT. 


(7) 


We note that xo/Xoo — O0 cT/ef). 

To obtain an expression for / that holds for all T, we proceed as follows. Denoting the 
number of particles with momentum p and magnetic moment parallel (or antiparallel) to 
the field by the symbol n+ (or n~), the total energy of the gas can be written as 


*»=E 


— - II*b] nt + ( — —I- I n„ 
2m ^ p \2m ^ I p 


=J2^P+ n P^-^ B(N+ ~ Nn ’ 

p 


( 8 ) 


where N + and N denote the total number of particles in the two groups, respectively. The 
partition function of the system is then given by 

Q(N) = ^'exp(-/S£„), (9) 


where the primed summation is subject to the conditions 

n P’ n p= Oort, (10) 


and 


J2 n p+J2 n P =n++n~=n. at) 

p p 

To evaluate the sum in (9), we first fix an arbitrary value of the number N + (which auto¬ 
matically fixes the value of N~ as well) and sum over all n+ and n~ that conform to the 
fixed values of the numbers N + and N~ as well as to condition (10). Next, we sum over all 
possible values of N + , namely from N + — 0 to N + = N. We thus have 


N 

Q(N) = E 


jv+=o 


p ftp*B(2N+-N) 


E'"exp -eZ&P 


[ni 


( 12 ) 
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here, the summation J2" i s subject to the restriction n~ = N + , while J2'" is subject to 
the restriction J2 P n p=N- N + . 

Now, let Qo(Af) denote the partition function of an ideal Fermi gas of A f “spinless” 
particles of mass m; then, obviously, 


Qo(A/") = ^ exp | -Pj2^—n p ) = exp {- pA 0 (A/")}, (13) 

{n p } \ P Zm ) 

where Ao(Af) is the free energy of this fictitious system. Equation (12) can then be 
written as 


N 

Q(N) = J2 [e 2 i* M * Biv+ Qo (N + )Qo(N — N + )], (14) 

N+=0 


which gives 

1 1 N 

— lnQ(AT) = ~Pfi*B+ — In [exp{2Pp*BN+- pA 0 (N+) - pA 0 (N- N+)}]. (15) 

iV+=0 


As before, the logarithm of the sum J2n+ ma Y i )e replaced by the logarithm of the largest 
term in the sum; the error committed in doing so would be negligible in comparison with 
the term retained. Now, the value N+, of N + , which corresponds to the largest term in 
the sum, can be ascertained by setting the differential coefficient of the general term, with 
respect to N + , equal to zero; this gives 


2p*B — 


~dA 0 {N+y 


' dA 0 (N - N + )~ 

dN+ 

N+=N+ 

9N+ 


that is 


M 0 (iV+)-M 0 (N-iV+) = 2/r*5, (16) 

where /ro(AO is the chemical potential of the fictitious system of Af “spinless” fermions. 

The foregoing equation contains the general solution being sought. To obtain an 
explicit expression for /, we introduce a dimensionless parameter r, defined by 

M = p.*(N+-N z: ) = p.*(2N+-N) = p*Nr (0<r<l); (17) 

equation (16) then becomes 

M°(^iv)- M o (^Ar)= 2 M*B. (18) 

If the magnetic field B vanishes so does r, which corresponds to a completely random ori¬ 
entation of the elementary moments. For small B, r would also be small; so, we may carry 
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out a Taylor expansion of the left side of (18) about r — 0. Retaining only the first term of 
the expansion, we obtain 


2fi* B 

1 ~ a/i 0 (xN ) i 

dx \x=\/2 


The low-field susceptibility (per unit volume) of the system is then given by 


(19) 


M n*Nr 2n/x* 2 
x ~ VB~ VB ~ 3 h 0 {xN) I ’ 

dx \x=\/2 

which is the desired result valid for all T. 

For T -y 0, the chemical potential of the fictitious system can be obtained from 
equation (8.1.34), with g = 1: 


Mo(xjV) = 


3xiV\ 2/3 h 2 
4nV) 2m’ 


which gives 


dfiojxN) 

dX 


x=l/2 


2 4 / 3 / 31V \ 
\4jtV) 


2/3^ 

2m 


( 21 ) 


On the other hand, the Fermi energy of the actual system is given by the same equa¬ 
tion (8.1.34), with g= 2: 


EF 


_ / 3 N \ 2/3 h 2 


\8jtV J 2m' 

Making use of equations (21) and (22), we obtain from (20) 


2tt/x* 2 3 *2 , 

xo = —a — = /bf, 

q ef 2 


( 22 ) 


(23) 


in complete agreement with our earlier result (6). For finite but low temperatures, one has 
to use equation (8.1.35) instead of (8.1.34). The final result turns out to be 


x-xo 


12 \e f J 


(24) 


On the other hand, for T -y oo, the chemical potential of the fictitious system follows 
directly from equation (8.1.4), with g = 1 and/ 3 / 2 (z) ~ z, with the result 


Hq(xN ) = kTln{xNX 3 /V) 
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which gives 


dpp(xN) 

dX 


= 2kT. 

x=l/2 


(25) 


Equation (20) then gives 


Xoo = nfi* 2 /kT, 


(26) 


in complete agreement with our earlier result (7). For large but finite temperatures, one 
has to take/ 3 / 2 (z) ~ z — (z 2 /2 3 / 2 ). The final result then turns out to be 


X - Xoo l - 


nX* 

2^2 


(27) 


the correction term here is proportional to ( Tp/T ) 3 / 2 and tends to zero as T —> oo. 


8.2.B Landau diamagnetism 

We now study the magnetism arising from the quantization of the orbital motion of 
(charged) particles in the presence of an external magnetic field. In a uniform field of 
intensity B, directed along the z-axis, a charged particle would follow a helical path whose 
axis is parallel to the z-axis and whose projection on the (x,y)-plane is a circle. Motion 
along the z-direction has a constant linear velocity u z , while that in the (x,y )-plane has a 
constant angular velocity eB/mc, the latter arises from the Lorentz force, e(u x B)/c, expe¬ 
rienced by the particle. Quantum-mechanically, the energy associated with the circular 
motion is quantized in units of ehB/mc. The energy associated with the linear motion 
along the z-axis is also quantized but, in view of the smallness of the energy intervals, this 
may be taken as a continuous variable. We thus have for the total energy of the particle 2 


E 


me V 2/ 2m 


(/ = 0 , 1 , 2 ,...). 


(28) 


Now, these quantized energy levels are degenerate because they result from a “coalescing 
together” of an almost continuous set of zero-field levels. A little reflection shows that all 
those levels for which the value of the quantity (p 2 + p 2 )/2m lay between ehBj/mc and 
ehB(j+ 1 )/mc now “coalesce together” into a single level characterized by the quantum 
number j. The number of these levels is given by 


jp J dxdydp x dpy 


LxLy 

~w 


ehB . 

2m - 10 + 1 ) -]} 

me 


— L xLy —, 


eB 
' he’ 


(29) 


2 See, for instance, Goldman et al. (1960); Problem 6.3. 
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FIGURE 8.3 The single-particle energy levels, for a two-dimensional motion, in the absence of an external magnetic 
field (B = 0) and in the presence of an external magnetic field (B > 0). 


which is independent of j. The multiplicity factor (29) is a quantum-mechanical measure 
of the freedom available to the particle for the center of its orbit to be “located” anywhere 
in the total area L x L y of the physical space. Figure 8.3 depicts the manner in which the 
zero-field energy levels of the particle group themselves into a spectrum of oscillator-like 
levels on the application of the external magnetic field. 

The grand partition function of the gas is given by the standard formula 

ln0 = ^ln(l+ze"^), (30) 


where the summation has to be carried over all single-particle states in the system. Sub¬ 
stituting (28) for s, making use of the multiplicity factor (29) and replacing the summation 
over p z by an integration, we get 


°° r oo 

lnfl = [ ^ In| i + ze -P‘* B V+<VW/ mc -PP l z/* n 

^ i—0 ' ' 

At high temperatures, z <3C 1; so, the system is effectively Boltzmannian. 
tion function then reduces to 


} ■ (31) 

The grand parti- 


ln& = 


zVeB 

h 2 c 


LXJ 

/ 


e -ppi/2m dpz J2 
i =o 


e -pehB\j+(l/2)]/mc 


zVeB 

h 2 c 


2 jrm\ b 2 

~r) 


12 sinh 


/ fiehB\ 
\ 2 me ) 



(32) 


The equilibrium number of particles N and the magnetic moment M of the gas are then 
given by 
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N = I z—In® 


B,V,T 


and 


M= - 


3 H 


1 


3 B/ P\dB 


In 0 


(33) 


(34) 


z,v,r 


where H is the Hamiltonian of the system; compare with equation (3.9.4). We thus obtain 

1351 

and 

7.V [ 1 rrnshr) 

(36) 


zV 

M = x^yMeff 


1 xcoshx | 
sinhx sinh 2 x j 


where A.{= Ii/CZtt mkT) ]/l2 } is the mean thermal wavelength of the particles, while 


x = /3/i e ffB (/x eff = eh/Anmc ). (37) 

Clearly, if e and m are the electronic charge and the electronic mass, then /x e u is the familiar 
Bohr magneton ///j. Combining (35) and (36), we get 

M = -Nfi eS L(x), (38) 


where L(x) is the Langevin function: 


L(x) = cothx— -. (39) 

x 

This result is very similar to the one obtained in the Langevin theory of paramagnetism; 
see Section 3.9. The presence of the negative sign, however, means that the effect obtained 
in the present case is diamagnetic in nature. We also note that this effect is a direct con¬ 
sequence of quantization; it vanishes if we let h -» 0. This is in complete accord with the 
Bohr-van Leeuwen theorem, according to which the phenomenon of diamagnetism does 
not arise in classical physics; see Problem 3.43. 

If the field intensity B and the temperature T are such that /i e ffT> <C kT, then the 
foregoing results become 



(40) 


and 


M ~ -N/jL 2 efl B/3kT. 


(41) 
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Equation (40) is in agreement with the zero-field formula z — nX 3 , while (41) leads to the 
diamagnetic counterpart of the Curie law: 

M _ 9 

Xoo — yg — — n i^ e f[/3kT; (42) 

see equation (3.9.12). It should be noted here that the diamagnetic character of this 
phenomenon is independent of the sign of the electric charge on the particle. For an elec¬ 
tron gas, in particular, the net susceptibility at high temperatures is given by the sum of 
expression (7), with p* replaced by /rg, and expression (42): 


Xoo — 



(43) 


where p' B = eh/Anrn'c, m' being the effective mass of the electron in the given system. 

We now look at this problem at all temperatures, though we will continue to assume 
the magnetic field to be weak, so that /x e ff B <<c kT. In view of the latter, the summation in 
(31) may be handled with the help of the Euler summation formula, 


OO 

J2f(j + -/ f(*)dx+ >0) 

J=° o 


(44) 


with the result 


ln0~ 


VeB 

h 2 c 


OO OO 


.0 -oo 


/ dx / dp z In j 1 + ze fibresBx+fA/im) 

OO 

f _ d P . 

J z - 1 


_ 12 


z~ 1 eP(Pz/ 2m} +1 


(45) 


The first part here is independent of B, which can be seen by changing the variable from x 
to x! — Bx. The second part, with the substitution fip\/2m = y, becomes 


rf<2m,3,2 <„0V' 2 / 


6 h 3 


:-V + l' 


(46) 


The low-field susceptibility (per unit volume) of the gas is then given by 


_ m _ i /a 




VB pVB\dB 
(2tt m) 3 l 2 p 2 i{ 


In & 


z,V,T 




(47) 


3h 3 p l l 2 
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which is the desired result. Note that, as before, the effect is diamagnetic in character — 
irrespective of the sign of the charge on the particle. 

For z«l,/i/ 2 (z)~z~ n)?] we then recover our previous result (42). For z1 (which 
corresponds to T 7»,/i/2Cz) ~ (2/7r 1 / 2 )(lnz) 1/2 ; we then get 


Xo 


2^(2 mf l2 ii 2 e ^y 2 
3 P 



W/Tgff/6FI 


(48) 


here, use has also been made of the fact that (fi~ l in z) — sf . Note that, in magnitude, this 
result is precisely one-third of the corresponding paramagnetic result (6) , provided that we 
take the /i* of that expression to be equal to the // e ff of this one. 


8.3 The electron gas in metals 

One physical system where the application of Fermi-Dirac statistics helped remove a 
number of inconsistencies and discrepancies is that of conduction electrons in metals. 
Historically, the electron theory of metals was developed by Drude (1900) and Lorentz 
(1904-1905), who applied the statistical mechanics of Maxwell and Boltzmann to the 
electron gas and derived theoretical results for the various properties of metals. The 
Drude-Lorentz model did provide a reasonable theoretical basis for a partial understand¬ 
ing of the physical behavior of metals; however, it encountered a number of serious 
problems of a qualitative as well as quantitative nature. For instance, the observed spe¬ 
cific heat of metals appeared to be almost completely accountable by the lattice vibrations 
alone and practically no contribution seemed to be coming from the electron gas. The 
theory, however, demanded that, on the basis of the equipartition theorem, each electron 
in the gas should possess a mean thermal energy | kT and hence make a contribution 
of | k to the specific heat of the metal. Similarly, one expected the electron gas to exhibit 
the phenomenon of paramagnetism arising from the intrinsic magnetic moment fig of the 
electrons. According to the classical theory, the paramagnetic susceptibility would be given 
by (8.2.7), with /x* replaced by hb■ Instead, one found that the susceptibility of a normal 
nonferromagnetic metal was not only independent of temperature but had a magnitude 
which, at room temperatures, was hardly 1 percent of the expected value. 

The Drude-Lorentz theory was also applied to study transport properties of met¬ 
als, such as the thermal conductivity K and the electrical conductivity a. While the 
results for the individual conductivities were not very encouraging, their ratio did con¬ 
form to the empirical law of Wiedemann and Franz (1853), as formulated by Lorenz 
(1872), namely that the quantity K/oT was a (universal) constant. The theoretical value 
of this quantity, which is generally known as the Lorenz number, turned out to be 
3 (k/e) 2 — 2.48 x 10 _13 e.s.u./deg 2 ; the corresponding experimental values for most alkali 
and alkaline-earth metals were, however, found to be scattered around a mean value of 
2.72 x 10 -13 e.s.u./deg 2 . A still more uncomfortable feature of the classical theory was the 
uncertainty in assigning an appropriate value to the mean free path of the electrons in 
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a given metal and in ascribing to it an appropriate temperature dependence. For these 
reasons, the problem of the transport properties of metals also remained in a rather 
unsatisfactory state until the correct lead was provided by Sommerfeld (1928). 

The most significant change introduced by Sommerfeld was the replacement of 
Maxwell-Boltzmann statistics by Fermi-Dirac statistics for describing the electron gas in 
a metal. With this single stroke of genius, he was able to set most of the things right. To 
see how it worked, let us first estimate the Fermi energy sf of the electron gas in a typical 
metal, say sodium. Referring to equation (8.1.24), with g = 2, 


sp = 


3 N \ 2/3 h 2 
8tt V ) 2m 1 ' 


( 1 ) 


where m' is the effective mass of an electron in the gas. 3 The electron density N/V, in the 
case of a cubic lattice, may be written as 


N n e n„ 

77 = (2) 

V a 3 

where n e is the number of conduction electrons per atom, n a the number of atoms per 
unit cell and a the lattice constant (or the cell length). 4 For sodium, n e = 1, n a — 2, and a = 
4.29 A. Substituting these numbers into (2) and writing m! = 0.98m e , we obtain from (1) 

(sp )Na = 5.03 x 10" 12 erg = 3.14 eV. (3) 

Accordingly, for the Fermi temperature of the gas is 


(T F ) Na = (1.16 X 10 4 ) X ep( in eV) 

= 3.64 x 10 4 K, (4) 

which is considerably larger than the room temperature T(~ 3 x 10 2 K). The ratio T/Tp 
being of the order of 1 percent, the conduction electrons in sodium constitute a highly 
degenerate Fermi system. This statement, in fact, applies to all metals because their Fermi 
temperatures are generally of order 10 4 - 10 5 K. 

Now, the very fact that the electron gas in metals is a highly degenerate Fermi system 
is sufficient to explain away some of the basic difficulties of the Drude-Lorentz theory. For 
instance, the specific heat of this gas would no longer be given by the classical formula, 

3 To justify the assumption that the conduction electrons in a metal may be treated as “free” electrons, it is necessary 
to ascribe to them an effective mass m 1 y m. This is an indirect way of accounting for the fact that the electrons in a metal 
are not really free; the ratio m'/m accordingly depends on the structural details of the metal and, therefore, varies from 
metal to metal. In sodium, m'/m ~ 0.98. 

4 Another way of expressing the electron density is to write N/V =fp/M, where / is the valency of the metal, p its 

mass density, and M the mass of an atom (o/M, thus, being the number density of the atoms). 
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Cy = |Mc, but rather by equation (8.1.39), namely 

7T 2 

Cy =—Nk(kT/ep); (5) 

obviously, the new result is much smaller in value because, at ordinary temperatures, 
the ratio ( JcT/ef ) = ( T/Tp) — O(10 -2 ). It is then hardly surprising that, at ordinary tem¬ 
peratures, the specific heat of metals is almost completely determined by the vibrational 
modes of the lattice and very little contribution comes from the conduction electrons. Of 
course, as temperature decreases, the specific heat due to lattice vibrations also decreases 
and finally becomes considerably smaller than the classical value; see Section 7.4, espe¬ 
cially Figure 7.14. A stage comes when the two contributions, both nonclassical, become 
comparable in value. Ultimately, at very low temperatures, the specific heat due to lattice 
vibrations, being proportional to T 3 , becomes even smaller than the electronic specific 
heat, which is proportional to T 1 . In general, we may write, for the low-temperature 
specific heat of a metal, 


C v =yT + ST 3 , (6) 

where the coefficient y is given by equation (5) or, more generally, can be shown to be 
proportional to the density of states at the Fermi energy (see Problem 8.13), while the coef¬ 
ficient <5 is given by equation (7.4.23). An experimental determination of the specific heat 
of metals at low temperatures is, therefore, expected not only to verify the theoretical result 
based on quantum statistics but also to evaluate some of the parameters of the problem. 

Such determinations have been made, among others, by Corak et al. (1955) who worked 
with copper, silver and, gold in the temperature range 1 to 5 K. Their results for copper are 
shown in Figure 8.4. The very fact that the (Cy/T) versus T 2 plot is well approximated by a 
straight line vindicates the theoretical formula (6). Furthermore, the slope of this line gives 
the value of the coefficient 8, from which one can extract the Debye temperature (-)o of 



FIGURE 8.4 The observed specific heat of copper at low temperatures (after Corak et al., 1955). 
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the metal. One thus gets for copper: ®d = (343.8 ± 0.5) K, which compares favorably with 
Leighton’s theoretical estimate of 345 K (based on the elastic constants of the metal). The 
intercept on the (CV/T)-axis yields the value of the coefficient y, namely (0.688 ± 0.002) 
millijoule mole -1 deg 2 , which agrees favorably with Jones’ estimate of 0.69 millijoule 
mole -1 deg -2 (based on a density-of-states calculation). 

The general pattern of the magnetic behavior of the electron gas in nonferromagnetic 
metals can be understood likewise. In view of the highly degenerate nature of the gas, the 
magnetic susceptibility / is given by the Pauli result (8.2.6) plus the Landau result (8.2.48), 
and not by the classical result (8.2.7). In complete agreement with the observation, the 
new result is (i) independent of temperature and (ii) considerably smaller in magnitude 
than the classical one. 

As regards transport properties K and a, the new theory again led to the Wiedemann- 
Franz law ; the Lorenz number, however, became (;r 2 /3)(k/e) 2 , instead of the classical 
3 (k/e) 2 . The resulting theoretical value, namely 2.71 x 10 -13 e.s.u./deg 2 , turned out to 
be much closer to the experimental mean value quoted earlier. Of course, the situa¬ 
tion regarding individual conductivities and the mean free path of the electrons did not 
improve until Bloch (1928) developed a theory that took into account interactions among 
the electron gas and the ion system in the metal. The theory of metals has continued 
to become more and more sophisticated; the important point to note here is that this 
development has all along been governed by the new statistics! 

Before leaving this topic, we would like to give a brief account of the phenomena of 
thermionic and photoelectric emission of electrons from metals. In view of the fact that 
electronic emission does not take place spontaneously, we infer that the electrons inside 
a metal find themselves caught in some sort of a “potential well’’ created by the ions. The 
detailed features of the potential energy of an electron in this well must depend on the 
structure of the given metal. For a study of electronic emission, however, we need not 
worry about these details and may assume instead that the potential energy of an elec¬ 
tron stays constant (at a negative value, — W, say) throughout the interior of the metal and 
changes discontinuously to zero at the surface. Thus, while inside the metal, the electrons 
move about freely and independently of one another; however, as soon as any one of them 
approaches the surface of the metal and tries to escape, it encounters a potential barrier 
of height W. Accordingly, only those electrons whose kinetic energy (associated with the 
motion perpendicular to the surface) is greater than W can expect to escape through this 
barrier. At ordinary temperatures, especially in the absence of any external stimulus, such 
electrons are too few in any given metal, with the result that there is practically no spon¬ 
taneous emission from metals. At high temperatures, and more so if there is an external 
stimulus present, the population of such electrons in a given metal could become large 
enough to yield a sizeable emission. We then speak of phenomena such as thermionic effect 
and photoelectric effect. 

Strictly speaking, these phenomena are not equilibrium phenomena because electrons 
are flowing out steadily through the surface of the metal. However, if the number of elec¬ 
trons lost in a given interval of time is small in comparison with their total population in 
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the metal, then the magnitude of the emission current may be calculated on the assump¬ 
tion that the gas inside continues to be in a state of quasistatic thermal equilibrium. The 
mathematical procedure for this calculation is very much the same as the one followed in 
Section 6.4 (for determining the rate of effusion R of the particles of a gas through an open¬ 
ing in the walls of the container). There is one difference, however; whereas in the case of 
effusion any particle that reached the opening with u z > 0 could escape unquestioned, 
here we must have u z > <2W/m) ]/2 , so that the particle in question could successfully 
cross over the potential barrier at the surface. Moreover, even if this condition is satisfied, 
there is no guarantee that the particle will really escape because the possibility of an inward 
reflection cannot be ruled out. In the following discussion, we shall disregard this possi¬ 
bility; however, if one is looking for a numerical comparison of theory with experiment, 
the results derived here must be multiplied by a factor (I — r), where r is the reflection 
coefficient of the surface. 


8.3.A Thermionic emission (the Richardson effect) 

The number of electrons emitted per unit area of the metal surface per unit time is given 
by 


OO OO OO 


--Iff 

pz=(2mW) 1 / 2 Px——°°Py—— ( x> 


2 dp x dpydp z l 

h 3 e (s—/i)/kT + } 


Uz, 


(7) 


compare with the corresponding expression in Section 6.4. Integration over the variables 
p x and p Y may be carried out by changing over to polar coordinates (p',f), with the result 


OO 



p z =(2mW) 1 / 2 


oo 

p^ d f _ 2 *P d P' _ 

m J exp{[(p ,2 /2m) + (p\/2m) - p\/kT} + 1 

p r =0 


4jtkT 
h 3 


/ 


p z =(2mW) 1 / 2 


Pzdpz ln[l + exp {(/x - pl/2m)/kT}\ 


OO 

f ds z ln[l + e^~^ kT l 

s z =W 


( 8 ) 


It so happens that the exponential term inside the logarithm, at all temperatures of inter¬ 
est, is much smaller than unity; see Note 5. We may, therefore, write ln(l +x) ~ x, with the 
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result 


OO 

I dsze ^-s z ykT 

s z =W 

_ 4 Jtmk 2 T 2 (/M _ W)/kT 

- *3 ' l3J 

The thermionic current density is then given by 

J = eR= A7im f T 2 e ( i‘- w )/ kT . ( 10 ) 

h 1 

It is only now that the difference between the classical statistics and the Fermi statistics 
really shows up! In the case of classical statistics, the fugacity of the gas is given by (see 
equation (8.1.4), with/ 3 / 2 (z) — z) 



nh 3 

2(2nmkT) 3 / 2 ’ 


( 11 ) 


accordingly, 


Jdass = ne (2^h) ' T l/2e ~ HkT (4> = W). ( 12 ) 

In the case of Fermi statistics, the chemical potential of the (highly degenerate) electron 
gas is practically independent of temperature and is very nearly equal to the Fermi energy 
of the gas (p ~ po = ep); accordingly, 

Anmek 2 9 + ,i. T 

/f.d. = T 2 e-^ kT (<(> = W — sf). (13) 

The quantity </> is generally referred to as the work function of the metal. According to (12), 
f is exactly equal to the height of the surface barrier; according to (13), it is equal to the 
height of the barrier over and above the Fermi level (see Figure 8.5). 
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FIGURE 8.5 The work function 0 of a metal represents the height of the surface barrier over and above the 
Fermi level. 
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The theoretical results embodied in equations (12) and (13) differ in certain important 
respects. The most striking difference seems to be in regard to the temperature depen¬ 
dence of the thermionic current density /. However, the major dependence on T comes 
through the factor exp(—0 /IcT) — so much so that whether we plot ln( J/T ]/2 ) against (1/ T) 
or ln(//T 2 ) against (1 /T) we obtain, in each case, a fairly good straight-line fit. Thus, from 
the point of view of the temperature dependence of /, a choice between formulae (12) and 
(13) is rather hard to make. However, the slope of the experimental line should give us 
directly the value of W if formula (12) applies or of {W - ep) if formula (13) applies! 

Now, the value of W can be determined independently, for instance, by studying the 
refractive index of a given metal for de Broglie waves associated with an electron beam 
impinging on the metal. For a beam of electrons whose initial kinetic energy is E, we have 


h , h 


A-out — —- and — —■- 

V(2 mE) 111 J[2m{E+W)] 


(14) 


so that the refractive index of the metal is given by 



(15) 


By studying electron diffraction for different values of E, one can derive the relevant value 
of W. In this manner, Davisson and Germer (1927) derived the value of W for a number of 
metals. For instance, they obtained for tungsten: W — 13.5 eV. The experimental results on 
thermionic emission from tungsten are shown in Figure 8.6. The value of </> resulting from 
the slope of the experimental line was about 4.5 eV. The large difference between these 
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FIGURE 8.6 Thermionic current from tungsten as a function of the temperature of the metal. The continuous line 
corresponds to r = A while the broken line corresponds to r = 0 ,r being the reflection coefficient of the surface. 
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two values clearly shows that the classical formula (12) does not apply. That the quantum- 
statistical formula (13) applies is shown by the fact that the Fermi energy of tungsten is 
very nearly 9eV; so, the value 4.5 eV for the work function of tungsten is correctly given 
by the difference between the depth W of the potential well and the Fermi energy ep. To 
quote another example, the experimental value of the work function for nickel was found 
to be about 5.0 eV, while the theoretical estimate for its Fermi energy turns out to be about 
11.8eV. Accordingly, the depth of the potential well in the case of nickel should be about 
16.8 eV. The experimental value of W, obtained by studying electron diffraction in nickel, 
is indeed (17 ± 1) eV. 5 

The second point of difference between formulae (12) and (13) relates to the actual 
value of the current obtained. In this respect, the classical formula turns out to be a com¬ 
plete failure while the quantum-statistical formula fares reasonably well. The constant 
factor in the latter formula is 


4 7i mek 2 
h 3 


120.4 amp cm 2 deg 2 ; 


(16) 


of course, this has yet to be multiplied by the transmission coefficient (1 — r). The corre¬ 
sponding experimental number, for most metals with clean surfaces, turns out to be in the 
range 60 to 120 amp cm -2 deg -2 . 

Finally, we examine the influence of a moderately strong electric field on the thermionic 
emission from a metal — the so-called Schottky effect. Denoting the strength of the electric 
field by F and assuming the field to be uniform and directed perpendicular to the metal 
surface, the difference A between the potential energy of an electron at a distance x above 
the surface and of one inside the metal is given by 

e 2 

A(x) = W — eFx - (x > 0), (17) 

4x 


where the first term arises from the potential well of the metal, the second from the (attrac¬ 
tive) field present, and the third from the attraction between the departing electron and the 
“image” induced in the metal; see Figure 8.7. The largest value of the function A(x) occurs 
atx = (e/4F) 1/2 , so that 


A max = W-e 3/2 F 1/2 i (18) 

thus, the field has effectively reduced the height of the potential barrier by an amount 
g3/2jh/2. a corresponding reduction should take place in the work function as well. 

5 In light of the numbers quoted here, one can readily see that the quantity in equation (8), being at most 

equal to eO'o-wj/W s e -<t/kT^ j Sj at a || temperatures of interest, much smaller than unity. This means that we are oper¬ 
ating here in the (Maxwellian) tail of the Fermi-Dirac distribution and hence the approximation made in going from 
equation (8) to equation (9) was justified. 
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FIGURE 8.7 A schematic diagram to illustrate the Schottky effect. 

Accordingly, the thermionic current density in the presence of the field F would be 
higher than the one in the absence of the field: 

J F = Jo exp(e 3/2 F 1/2 /fcT). (19) 

A plot of ln(/f// 0 ) against (F 1/2 /T) should, therefore, be a straight line, with slope e Ar2 /k. 
Working along these lines, De Bruyne (1928) obtained for the electronic charge a value of 
4.84 x 10 -10 e.s.u., which is remarkably close to the actual value of e. 

The theory of the Schottky effect, as outlined here, holds for field strengths up to about 
10 6 volts/cm. For fields stronger than that, one obtains the so-called cold emission, which 
means that the electric field is now strong enough to make the potential barrier practically 
ineffective; for details, see Fowler and Nordheim (1928). 

8.3.B Photoelectric emission (the Hallwachs effect) 

The physical situation in the case of photoelectric emission is different from that in the 
case of thermionic emission, in that there exists now an external agency, the photon in the 
incoming beam of light, that helps an electron inside the metal in overcoming the potential 
barrier at the surface. The condition to be satisfied by the momentum component p z of an 
electron in order that it could escape from the metal now becomes 

(p 2 z /2m) + hv > W, (20) 6 

where v is the frequency of the incoming light (assumed monochromatic). Proceeding in 
the same manner as in the case of thermionic emission, we obtain, instead of (8), 

OO 

4:jr mkT f / \tb r r 

R=— p — / ds z fn[l + e^~ £z)/fcr ]. (21) 

e z =W—liv 

We cannot, in general, approximate this integral the way we did there; so the integrand 
here stays as it is. It is advisable, however, to change over to a new variable x, defined by 

x=(e z -W+hv)/tcT, (22) 

6 In writing this condition, we have tacitly assumed that the momentum components p x and p y of the electron remain 
unchanged on the absorption of a photon. 
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whereby equation (21) becomes 


R = 


4nm(kT) 2 
h 3 


OO 

J dx In 
o 


1 + exp 


h(v - i’o) 
kT 



(23) 


where 


hv o = W — fi—W — sp = 4>. (24) 

The quantity </> will be recognized as the (thermionic) work function of the metal; accord¬ 
ingly, the characteristic frequency vo(= <p/h) may be referred to as the threshold frequency 
for (photoelectric) emission from the metal concerned. 

The current density of photoelectric emission is thus given by 

/ - Ann ™ k T 2 j dx I n (1 + e s ~ x ), (25) 

o 


where 


S = h(v — vq )/kT. 


(26) 


Integrating by parts, we find that 


/ dxln(l + e s x ) = j - =f 2 (e s y, 
o o 

see equation (8.1.6). Accordingly, 


/ = 


47r mek 2 

F 


T 2 f 2 (e s ). 


(27) 


(28) 


For h(v - vo) > kT, e s » 1 and the function f 2 {e & ) 5 2 /2; 

in Appendix E. Equation (28) then becomes 


see Sommerfeld’s lemma (E.17) 

(29) 


which is completely independent of T ; thus, when the energy of the light quantum is much 
greater than the work function of the metal, the temperature of the electron gas becomes 
a “dead” parameter of the problem. At the other extreme, when v < vq and h\v — vol » kT, 
then e & <$; 1 and the function f 2 (e s ) e s . Equation (28) then becomes 

^nnwk 2 rj,2„{hv-d))/kT 

J ~h 2 


(30) 
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which is just the thermionic current density (13), enhanced by the photon factor 
exp (hv/kT); in other words, the situation now is very much the same as in the case of 
thermionic emission, except for a diminished work function </>'(= <j> — hv). At the threshold 
frequency (v = vo), 5 = 0 and the function fz(e^) =/ 2 (l) = 7 t 2 /12; see equation (E.16), with 
7=1. Equation (28) then gives 


Jo = 


n 3 mek 2 9 
- T z 


(31) 


Figure 8.8 shows a plot of the experimental results for photoelectric emission from 
palladium (</> = 4.97 eV). The agreement with theory is excellent. It will be noted that the 
plot includes some observations with v < v 0 . The fact that we obtain a finite photocur¬ 
rent even for frequencies less than the so-called threshold frequency is fully consistent 
with the model considered here. The reason for this lies in the fact that, at any finite tem¬ 
perature T, there exists in the system a reasonable fraction of electrons whose energy s 
exceeds the Fermi energy ep by amounts O(kT). Therefore, if the light quantum hv gets 
absorbed by one of these electrons, then condition (20) for photoemission can be satisfied 
even if hv < (W - sp) = hv 0 . Of course, the energy difference h(v 0 — v) must not be much 
more than a few times kT, for otherwise the availability of the right kind of electrons will be 
extremely low. We, therefore, do expect a finite photocurrent for radiation with frequencies 
less than the threshold frequency vo, provided that h(v o — v) = O(kT). 

The plot shown in Figure 8.8, namely ln(//T 2 ) versus 8, is generally known as the 
“Fowler plot.” Fitting the observed photoelectric data to this plot, one can obtain the 
characteristic frequency vo and hence the work function </> of the given metal. We have 
previously seen that the work function of a metal can be derived from thermionic data as 
well. It is gratifying to note that there is complete agreement between the two sets of results 
obtained for the work function of the various metals. 



FIGURE 8.8 Photoelectric current from palladium as a function of the quantity h(v — m )/kT. The plot includes 
data taken at several temperatures T for different frequencies v. 
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8.4 Ultracold atomic Fermi gases 

After the demonstration of Bose-Einstein condensation in ultracold atomic gases in 1995 
(Section 7.2), researchers began using laser cooling and magnetic traps to cool gases of 
fermions to create degenerate Fermi gases of atoms. DeMarco and lin (1999) created the 
first degenerate atomic Fermi gas by cooling a dilute vapor of 40 K in an atomic trap into 
the nanokelvin temperature range. The density of states in a harmonic trap is a quadratic 
function of the energy: 


a(e) = 


2(hco 0 f ’ 


( 1 ) 


where co 0 = is the geometric mean of the trap frequencies in the cartesian 

directions; see equation (7.2.3). The chemical potential and the number of fermions in 
the trap are related by 


N(n,T) 


1 f s 2 de 

2 (hcoo) 3 J e^-^ + l ’ 
o 


( 2 ) 


which gives for the Fermi energy 


Ef = (6iV) ; (3) 

a Fermi temperature Tp — sp/k — 870 nK for 10 6 atoms in a 100 Hz trap, and a ground-state 
energy Uq — | Nsp . The internal energy of the trapped gas can be obtained by time-of- 
flight measurements as described in Section 7.2 and can be directly compared with the 
theoretical result 


U 

Uo 


. OO 

/ T \ 4 r x 3 dx 

\Tp ) J e x e~^ + 1 

o 


where the temperature and the chemical potential are related by 


3 


a OO 

/ TV / x 2 dx 

\Tp ) J e x e~^ + 1 

o 


= 1 ; 


(4) 


(5) 


see Figures 8.9 and 8.10. At low enough temperatures, attractive interactions lead to BEC- 
BCS condensation, as discussed in Section 11.9. 
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FIGURE 8.9 Scaled internal energy (U/Uo) versus scaled temperature (T/Tp) for an ideal Fermi gas in a harmonic 
trap from equations (4) and (5). The dotted line is the corresponding classical result U(T) = 3 NkT. 



T/T f 


FIGURE 8.10 Experimental results for the mean energy per particle divided by the equipartition value of 3 kT versus 
scaled temperature ( T/Tp ) for ultracold 40 K atoms in a harmonic trap compared to the theoretical Fermi gas value 
from equations (4) and (5). This shows the development of the Fermi degeneracy of the gas at low temperatures; 
from Jin (2002). Figure courtesy of the IOP. Reprinted with permission; copyright ©2002, American Institute of 
Physics. 


8.5 Statistical equilibrium of white dwarf stars 

Historically, the first application of Fermi statistics appeared in the field of astrophysics 
(Fowler, 1926). It related to the study of thermodynamic equilibrium of white dwarf stars — 
the small-sized stars that are abnormally faint for their (white) color. The general pattern 
of color-brightness relationship among stars is such that, by and large, a star with red color 
is expected to be a “dull" star, while one with white color is expected to be a “brilliant" star. 
However, white dwarf stars constitute an exception to this rule. The reason for this lies in 
the fact that these stars are relatively old whose hydrogen content is more or less used up, 
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with the result that the thermonuclear reactions in them are now proceeding at a rather 
low pace, thus making these stars a lot less bright than one would expect on the basis of 
their color. The material content of white dwarf stars, at the present stage of their career, is 
mostly helium. And whatever little brightness they presently have derives mostly from the 
gravitational energy released as a result of a slow contraction of these stars — a mechanism 
first proposed by Kelvin, in 1861, as a “possible” source of energy for all stars! 

A typical, though somewhat idealized, model of a white dwarf star consists of a mass 
M(~ 10 33 g) of helium, packed into a ball of mass density p(~ 10 7 gcm -3 ), at a central 
temperature T(~ 10 7 K). Now, a temperature of the order of 10 7 K corresponds to a mean 
thermal energy per particle of the order of 10 3 eV, which is much greater than the energy 
required for ionizing a helium atom. Thus, practically the whole of the helium in the star 
exists in a state of complete ionization. The microscopic constituents of the star may, 
therefore, be taken as N electrons (each of mass m) and jN helium nuclei (each of mass 
~ Amp). The mass of the star is then given by 

M ~ N{m + 2 nip) ~ 2Nm p 

and, hence, the electron density by 

N ^ M/2m p p 
V ~ M/p 2m p ' 

A typical value of the electron density in white dwarf stars would, therefore, be O(10 30 ) 
electrons per cm 3 . We thus obtain for the Fermi momentum of the electron gas [see 
equation (8.1.23), withg = 2] 


( 1 ) 


( 2 ) 


Pf = 



1/3 

Ji = 0( 10 17 )gcmsec \ 


(3) 


which is rather comparable with the characteristic momentum me of an electron. The 
Fermi energy sf of the electron gas will, therefore, be comparable with the rest energy me 2 
of an electron, that is, sf = O(10 6 ) eV and hence the Fermi temperature Tp = O(10 10 ) K. In 
view of these estimates, we conclude that (i) the dynamics of the electrons in this problem 
is relativistic, and (ii) the electron gas, though at a temperature large in comparison with 
terrestrial standards, is, statistically speaking, in a state of ( almost ) complete degeneracy: 
( T/Tp ) = O(10 -3 ). The second point was fully appreciated, and duly taken into account, 
by Fowler himself; the first one was taken care of later, by Anderson (1928) and by Stoner 
(1929, 1930). The problem, in full generality, was attacked by Chandrasekhar (1931-1935) 
to whom the final picture of the theory of white dwarf stars is chiefly due; for details, see 
Chandrasekhar (1939), where a complete bibliography of the subject is given. 

Now, the helium nuclei do not contribute as significantly to the dynamics of the prob¬ 
lem as do the electrons; in the first approximation, therefore, we may neglect the presence 
of the nuclei in the system. For a similar reason, we may neglect the effect of the radia¬ 
tion as well. We may thus consider the electron gas alone. Further, for simplicity, we may 
assume that the electron gas is uniformly distributed over the body of the star; we are thus 
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ignoring the spatial variation of the various parameters of the problem — a variation that 
is physically essential for the very stability of the star! The contention here is that, in spite 
of neglecting the spatial variation of the parameters involved, we expect that the results 
obtained here will be correct, at least in a qualitative sense. 

We study the ground-state properties of a degenerate Fermi gas composed of N 
relativistic electrons (g = 2). First of all, we have 


PF 



(4) 


0 


which gives 



(5) 


The energy-momentum relation for a relativistic particle is 

s = me 2 !!! + (p/mc) 2 } 1 ^ 2 — 1], 


( 6 ) 


the speed of the particle being 


ds 


(p/m) 


(7) 


u = — 


dp {1 + (p/me) 2 } 1 / 2 


here, m denotes the rest mass of the electron. The pressure P 0 of the gas is then given by, 
see equation (6.4.3), 



( 8 ) 


0 


We now introduce a dimensionless variable 6, defined by 


p = mcsinhS 


(9) 


which makes 


u — ctanh0. 


( 10 ) 


Equations (4) and (8) then become 



( 11 ) 


and 



(12) 


0 
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where 


A(x) = x(x 2 + l)h 2 (2x 2 — 3) + 3sinh 1 x, 


with 


x = sinh@f = pp/mc = (3n/8n) 1 i 3 (h/mc). 


(13) 


(14) 


The function A(x) can be computed for any desired value of x. However, asymptotic results 
for x 1 and x » 1 are often useful; these are given by (see Kothari and Singh, 1942) 


A(r\ 8 r 5 _ 4 r 7 1 r 9 5_ r ll . 

1 1 \ -A/ J ^ A y A | ^ A 22 I 

= 2x 4 - 2x 2 + 3(ln2x - ^) + |x“ 2 + ■ ■ ■ 


for x <§; 1 
for x»l 


(15) 


We shall now consider, somewhat crudely, the equilibrium configuration of this model. 
In the absence of gravitation, it would be necessary to have “external walls” for keeping the 
electron gas at a given density n. The gas will exert a pressure Po(n) on the walls and any 
compression or expansion (of the gas) will involve an expenditure of work. Assuming the 
configuration to be spherical, an adiabatic change in V will cause a change in the energy 
of the gas, as given by 


dEo = —Po(n)dV = —Pq(R) • 4nR 2 dR. 


(16) 


In the presence of gravitation, no external walls are needed, but the change in the kinetic 
energy of the gas, as a result of a change in the size of the sphere, will still be given by for¬ 
mula (16); of course, the expression for Po> as a function of the “mean” density n, must 
now take into account the nonuniformity of the system — a fact being disregarded in the 
present simple-minded treatment. However, equation (16) alone no longer gives us the net 
change in the energy of the system; if that were the case, the system would expand indefi¬ 
nitely till both n and Po(ri) 0. Actually, we have now a change in the potential energy as 
well; this is given by 


dEg = 


(dEg\ 
\dR ) 


J GM 2 
dR = a — dR, 
R 2 


(17) 


where M is the total mass of the gas, G the constant of gravitation, while a is a number (of 
the order of unity) whose exact value depends on the nature of the spatial variation of n 
inside the sphere. If the system is in equilibrium, then the net change in its total energy 
(Eq + Eg), for an infinitesimal change in its size, should be zero; thus, for equilibrium, 

a GM 2 
4?r R 4 


Po(R) = 


(18) 
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For P 0 (R), we may substitute from equation (12), where the parameter x is now given by 

/3«X 1|/3 h / 9AT X 1 / 3 h/mc 
\8jry me \32n 2 ) R 


or, in view of (1), by 


/ 9 MX 

h/mc t 

( 9jtM\ 

! 1,/3 h/mc 

\64 n 2 m p J 

^ R ' 

l 8 m p ) 

' R 


Equation (18) then takes the form 


A 


9jtM 1 1/3 h/mc\ _ 3 ah 3 GM 2 

8 m p J R J An 2 m A c 5 R 4 

(h/mc\ 3 GM 2 /R 

= djrc, {-JT) 


(19) 


( 20 ) 


the function A(x) is given by equations (13) and (15). 

Equation (20) establishes a one-to-one correspondence between the masses M and 
the radii R of white dwarf stars; it is, therefore, known as the mass-radius relationship for 
these stars. It is rather interesting to see the combinations of parameters that appear in 
this relationship; we have here (i) the mass of the star in terms of the proton mass, (ii) the 
radius of the star in terms of the Compton wavelength of the electron, and (iii) the grav¬ 
itational energy of the star in terms of the rest energy of the electron. This relationship, 
therefore, exhibits a remarkable blending of quantum mechanics, special relativity, and 
gravitation. 

In view of the implicit character of relationship (20) , we cannot express the radius of the 
star as an explicit function of its mass, except in two extreme cases. For this, we note that, 
since M ~ 10 33 g, m p ~ 10 _24 g, and h/mc ~ 10 -11 cm, the argument of the function A{x) 
will be of the order of unity when R ~ 10 8 cm. We may, therefore, define the two extreme 
cases as follows: 

(i) R » 10 8 cm, which makes r«l and hence A(x) ^ fx 5 , with the result 


3(9tt) 2/3 h 2 M~V 3 
40 “ Gmm p /3 


( 21 ) 


(ii) R 4C10 8 cm, which makes x » 1 and hence A(x) ~ 2x 4 — 2x 2 , with the result 


(9tt ) 4 / 3 h / M \ 1/3 

\m p ) 


1 - 


/ m _\ 2/3 1 1/2 
VMoJ j 


R 


2 me 


( 22 ) 
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where 



9 / 3 tt \ 1/2 ( ftc / G) 3 / 2 


( 23 ) 


We thus find that the greater the mass of the white dwarf star, the smaller its size. Not 
only that, there exists a limiting mass M 0 , given by expression (23), that corresponds to a 
vanishing size of the star. Obviously, for M > M 0 , our mass-radius relationship does not 
possess any real solution. We, therefore, conclude that all white dwarf stars in equilibrium 
must have a mass less than M 0 — a conclusion fully upheld by observation. 

The correct limiting mass of a white dwarf star is generally referred to as the 
Chandrasekhar limit. The physical reason for the existence of this limit is that for a mass 
exceeding this limit the ground-state pressure of the electron gas (that arises from the fact 
that the electrons obey the Pauli exclusion principle) would not be sufficient to support 
the star against its “tendency toward a gravitational collapse.” The numerical value of the 
limiting mass, as given by expression (23), turns out to be ~ 10 33 g. Detailed investigations 
by Chandrasekhar led to the result: 



( 24 ) 


where © denotes the mass of the sun, which is about 2 x 10 33 g, while p e is a number 
that represents the degree of ionization of helium in the gas. By definition, p e = M/Nmn 
compare to equation (1). Thus, in most cases, p e ~ 2; accordingly M 0 ~ 1.440. 

Figure 8.11 shows a plot of the theoretical relationship between the masses and the radii 
of white dwarf stars. One can see that the behavior in the two extreme regions, namely for 
R^> l and R <<c /, is described quite well by formulae (21) and (22) of the treatment given 
here. The Chandrasekhar limit (24) is the mechanism responsible for stellar collapse into 
neutron stars and black holes. In particular, white dwarf stars whose mass exceeds the 
Chandrasekhar limit due to influx of matter from a companion binary star are thought to 
be the primary mechanism for type la supernovae; see Hillebrandt and Niemeyer (2000). 
Such events happen in a typical galaxy on the order of once per hundred years. For a few 
days after the collapse and subsequent explosion, these supernovae can be comparable in 
brightness to the remainder of the stars in the galaxy combined. Their well-calibrated light 
curves provide a bright “standard candle” for determining the distance to remote galaxies 
used to measure the expansion rate of the universe; see Chapter 9. 


8.6 Statistical model of the atom 


Another application of the Fermi statistics was made by Thomas (1927) and Fermi (1928) 
for calculating the charge distribution and the electric held in the extra-nuclear space of 
a heavy atom. Their approach was based on the observation that the electrons in this sys¬ 
tem could be regarded as a completely degenerate Fermi gas of nonuniform density n(r). 
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FIGURE 8.11 The mass-radius relationship for white dwarfs (after Chandrasekhar, 1939). The masses are 
expressed in terms of the limiting mass Mo and the radii in terms of a characteristic length l, which is given 
by 7.71/c- 1 x 10 8 cm ~ 3.86 x 10 8 cm. 


By considering the equilibrium state of the configuration, one arrives at a differential equa¬ 
tion whose solution gives directly the electric potential 0 (r) and the electron density n(r) 
at point r. By the very nature of the model, which is generally referred to as the Thomas- 
Fermi model of the atom, the resulting function n(r) is a smoothly varying function of r, 
devoid of the “peaks” that would remind one of the electron orbits of the Bohr theory. 
Nevertheless, the model has proved quite useful in deriving composite properties such 
as the binding energy of the atom. And, after suitable modifications, it has been success¬ 
fully applied to molecules, solids, and nuclei as well. Here, we propose to outline only the 
simplest treatment of the model, as applied to an atomic system; for further details and 
other applications, see Gombas (1949, 1952) and March (1957), where references to other 
contributions to the subject can also be found. 

According to the statistics of a completely degenerate Fermi gas, we have exactly two 
electrons (with opposite spins) in each elementary cell of the phase space, with p < pp\ the 
Fermi momentum pp of the electron gas is determined by the electron density n, according 
to the formula 


Pf = (3n 2 n) l ^h. 


( 1 ) 


In the system under study, the electron density varies from point to point; so would the 
value of pf. We must, therefore, speak of the limiting momentum pp as a function of r, 
which is clearly a “quasiclassical” description of the situation. Such a description is jus¬ 
tifiable if the de Broglie wavelength of the electrons in a given region of space is much 
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smaller than the distance over which the functions pF{r), n{r), and </>(#•) undergo a signifi¬ 
cant variation; later on, it will be seen that this requirement is satisfied reasonably well by 
the heavier atoms. 

Now, the total energy e of an electron at the top of the Fermi sea at the point r is given 

by 

sir) = —p 2 (r) - e<j>ir), ( 2 ) 

where e denotes the magnitude of the electronic charge. When the system is in a stationary 
state, the value of e(r) should be the same throughout, so that electrons anywhere in the 
system do not have an overall tendency to “flow away” toward other parts of the system. 
Now, at the boundary of the system, pp must be zero; by a suitable choice of the zero of 
energy, we can also have 0 = 0 there. Thus, the value of s at the boundary of the system is 
zero; so must, then, be the value of e throughout the system. We thus have, for all r, 

■J—Ppir) — e<f>(r) = 0. (3) 

2m 

Substituting from (1) and making use of the Poisson equation, 

V 2 0(r) = —4 np(r) — 4nen(r), (4) 


we obtain 


V 2 0(r) = 


4e(2 me ) 3 / 2 
3nh? 


{0(r)) 3/2 . 


Assuming spherical symmetry, equation (5) takes the form 


1 d 
r 2 dr 



4e(2 me ) 3 / 2 
3nfr? 


{ 0 (r)l 3/2 , 


(5) 


( 6 ) 


which is known as the Thomas-Fermi equation of the system. Introducing dimensionless 
variables x and <J>, defined by 


/ 4 \ 2 /3 , n me 2 Z 1 / 3 

r—9 _ _r—_ r 

\3n) h 2 0.88534a/ 


(7) 


and 


*(*) = 


<t>(r) 

Ze/r’ 


( 8 ) 


where Z is the atomic number of the system and as the first Bohr radius of the hydrogen 
atom, equation ( 6 ) reduces to 


d 2 <t> 4> 3 / 2 


dx 2 x 1 ! 2 ' 


(9) 
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Equation (9) is the dimensionless Thomas-Fermi equation of the system. The boundary 
conditions on the solution to this equation can be obtained as follows. As we approach the 
nucleus of the system (r —>■ 0), the potential </>(r) approaches the unscreened value Ze/r ; 
accordingly, we must have: 4>(x -»■ 0) = 1. On the other hand, as we approach the bound¬ 
ary of the system (r —>■ ro ), </>(r) in the case of a neutral atom must tend to zero; accordingly, 
we must have: (x -»• xo) = 0. In principle, these two conditions are sufficient to determine 
the function <t> (x) completely. However, it would be helpful if one knew the initial slope of 
the function as well, which in turn would depend on the precise location of the boundary. 
Choosing the boundary to be at infinity fr () = oo), the appropriate initial slope of the func¬ 
tion <t> (x) turns out to be very nearly —1.5886; in fact, the nature of the solution near the 
origin is 

$(x) = 1- 1.5886x+^x 3/2 + --- (10) 

For x > 10, the approximate solution has been determined by Sommerfeld (1932): 



where 


V(73) - 7 


0.257. 


( 12 ) 


As x-y oo, the solution tends to the simple form: <J>(x) ~ 144/x 3 . The complete solution, 
which is a monotonically decreasing function of x, has been tabulated by Bush and Cald¬ 
well (1931). As a check on the numerical results, we note that the solution must satisfy the 
integral condition 


f^ x vt dl . h (. 3 ) 

0 

which expresses the fact that the integral of the electron density n(r) over the whole of the 
space available to the system must be equal to Z, the total number of electrons present. 

From the function 4>(x), one readily obtains the electric potential 0 (r) and the electron 
density n(r): 


and 


4>(r) = 



rZ V3 \ 
0.88534fig J 


ocZ 4 / 3 


(2 me) 3 / 2 , 

n(r) = 37 r 2 fe 3 l < H ? ')i 3/2 ■ 


(14) 


(15) 


3n 2 h 3 
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FIGURE 8.12 The electron distribution function D(r) for an atom of mercury. The distance r is expressed in terms 
of the atomic unit of length a(= fi 2 /rae 2 ). 


A Thomas-Fermi plot of the electron distribution function D(r){= n(r) ■ Airr 2 } for an atom 
of mercury is shown in Figure 8.12; the actual “peaked” distribution, which conveys unmis¬ 
takably the preference of the electrons to be in the vicinity of their semiclassical orbits, is 
also shown in the figure. 

To calculate the hinding energy of the atom, we should determine the total energy of the 
electron cloud. Now, the mean kinetic energy of an electron at the point r would be | ep(r); 
by equation (3), this is equal to iecpir). The total kinetic energy of the electron cloud is, 
therefore, given by 


OO 


3 

-e 

5 


/ 


<j>(r)n(r) ■ 4nr 2 dr. 


o 


(16) 


For the potential energy of the cloud, we note that a part of the potential <f>(r) at the point 
r is due to the nucleus of the atom while the rest of it is due to the electron cloud itself; 
the former is clearly (Ze/r), so the latter must be (</>(r) — Ze/r}. The total potential energy 
of the cloud is, therefore, given by 


cxj 

- e ! 


Ze 1 


Ze 


— + ^U(r)- 


n(r) ■ 4nr z dr. 


(17) 


We thus obtain for the total energy of the cloud 


E 0 = 


1 1 Ze 2 I o , 

— ed>(r) -> n(r) ■ 4nr dr; 

10 2 r I 


(18) 
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of course, the electron density n(r), in terms of the potential function 0(r), is given by 
equation (15). 

Now, Milne (1927) has shown that the integrals 


OO 


0 


r z dr and 


OO 

J {0(r)} 3/2 rdr, 


(19) 


which appear in the expression for E 0 , can be expressed directly in terms of the initial slope 
of the function <J>(x), that is, in terms of the number —1.5886 of equation (10). After a little 
calculus, one finds that 


1.5886 

0.88534 



( 20 ) 


from which one obtains for the (Thomas-Fermi) binding energy of the atom: 

E b = —Eq = 1.538Z 7 / 3 x, (21) 

where x(= e 2 /2a[>, ~ 13.6 eV) is the (actual) binding energy of the hydrogen atom. 

It is clear that our statistical result (21) cannot give us anything more than just the first 
term of an “asymptotic expansion” of the binding energy Eb in powers of the parameter 
Z _1 /3 p or practical values of Z, other terms of the expansion are also important; however, 
they cannot be obtained from the simple-minded treatment given here. The interested 
reader may refer to the review article by March (1957). 

In the end we observe that, since the total energy of the electron cloud is proportional to 
Z 7/3 , the mean energy per electron would be proportional to Z 4 / 3 ; accordingly, the mean 
de Broglie wavelength of the electrons in the cloud would be proportional to Z _2</3 . At 
the same time, the overall linear dimensions of the cloud are proportional to Z -1 / 3 ; see 
equation (7). We thus find that the quasiclassical description adopted in the Thomas- 
Fermi model is more appropriate for heavier atoms (so that Z -2 / 3 <^Z -1 / 3 ). Otherwise, 
too, the statistical nature of the approach demands that the number of particles in the 
system be large. 


Problems 

8.1 Let the Fermi distribution at low temperatures be represented by a broken line, as shown in 
Figure 8.13, the line being tangential to the actual curve at e = /i. Show that this approximate 
representation yields a “correct” result for the low-temperature specific heat of the Fermi gas, 
except that the numerical factor turns out to be smaller by a factor of 4 In 2 . Discuss, in a qualitative 
manner, the origin of this numerical discrepancy. 

8.2 For a Fermi-Dirac gas, we may define a temperature To at which the chemical potential of the gas is 
zero (z= 1). Express To in terms of the Fermi temperature 7> of the gas. 

[Hint. Use equation (E.16).] 
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FIGURE 8.13 An approximate representation of the Fermi distribution at low temperatures: here, 

x = s/kT and f = p/kT. 


8.3 Show that for an ideal Fermi gas 


l/3z\ 5 / 5/z (z). 

z\dTj p 2Tf 3/2 (z)’ 


compare with equation (8.1.9). Hence show that 


_ Cp _ (dz/dT)p _ 5/5/2(Z)/i/2(3) 

K “ Cv ~ 0 z/dT) v ~~ 3 {/ 3 / 2 (z)} 2 

Check that at low temperatures 



8.4 (a) Show that the isothermal compressibility kj and the adiabatic compressibility ks of an ideal 
Fermi gas are given by 


K T = 


1 /l/ 2 (z) 


nlcT f 3 / 2 (z) ’ KS 5nkTf 5/2 (z)’ 
where n (= N/V) is the particle density in the gas. Check that at low temperatures 

-2 / urns 2 


3 /v 2 ( 2 ) 


Kp ~ 


2 HBF 
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(b) Making use of the thermodynamic relation 

Cp — Cv 

show that 


-C v = t(^] =TVk t (^ 


y\3Tj 

Cp — Cy _ 4 Cy fi/ 2 (z) 
Cy 9Nkf 3 / 2 (z) 

7i 2 (kT^ 2 

, „ 


, TV “ T U, v 


3 


(kT «; ef). 
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(c) Finally, making use of the thermodynamic relation y = kj/ks, verify the results of 
Problem 8.3. 

8.5 Evaluate (8 2 P/dT 2 ) v , (d 2 p/dT 2 ) V} and ( d 2 p/dT 2 )p of an ideal Fermi gas and check that your results 
satisfy the thermodynamic relations 


C v = VT 


d 2 P \ 

ar 2 ) 

/ V 


-NT 



V 


and 


C P = -NT 



Examine the low-temperature behavior of these quantities. 

8.6 Show that the velocity of sound w in an ideal Fermi gas is given by 


;2 _ 5A.T f 5 /z(z) 
~ 3 m f 3/2 (z) 



where ( u 2 ) is the mean square speed of the particles in the gas. Evaluate w in the limit z -y oo and 
compare it with the Fermi velocity uf- 

8.7 Show that for an ideal Fermi gas 


(u) 



x {/3/2(Z)} 2 ’ 


u being the speed of a particle. Further show that at low temperatures 


(u) 


/1\ 

9 

n 2 

(kT\ 2 ' 

- 

1+ — 

— 

\ u\ 

8 

12 

) 


compare with Problem 6.6. 

8.8 Obtain numerical estimates of the Fermi energy (in eV) and the Fermi temperature (in K) for the 
following systems: 

(a) conduction electrons in silver, lead, and aluminum; 

(b) nucleons in a heavy nucleus, such as soFIg 200 , and 

(c) Fie 3 atoms in liquid helium-3 (atomic volume: 63 A 3 per atom). 

8.9 Making use of another term of the Sommerfeld lemma (E.17), show that in the second 
approximation the chemical potential of a Fermi gas at low temperatures is given by 






2 


Jr 4 /W\ 4 ' 
80 ) 


and the mean energy per particle by 


U 

N 





it 4 / kT \ 4 

16 V^J 


(8.1.35a) 


(8.1.37a) 


Hence determine the T 3 -correction to the customary r 1 -result for the specific heat of an electron 
gas. Compare the magnitude of the I 3 -term, in a typical metal such as copper, with the low- 
temperature specific heat arising from the Debye modes of the lattice. For further terms of these 
expansions, see Kiess (1987). 
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8.10 Consider an ideal Fermi gas, with energy spectrum cap 1 , contained in a box of “volume” V in a 
space of n dimensions. Show that, for this system, 

(a) PV=-U; 
n 

... Cy _ n/n \ /(n/s) + i Cz) ( n \ 2 fn/s{z) _ 

Nk S Vs / fn/s(z) \s) fn/s)—l(z) ’ 

Cp-Cy _ ( sCy \ 2 f(n/s)~l(Z) _ 
c Nk \ nNk) f n /s)(z) 

(d) the equation of an adiabat is PV^Hs/n) = const., and 

(e) the index (1 + ( s/n)) in the foregoing equation agrees with the ratio (Cp/Cy) of the gas only 
when T 3 > Tp. On the other hand, when T <g; Tp, the ratio (Cp/Cy) ~ 1 + (n 2 /3)(kT/ep) 2 , 
irrespective of the values of .s' and n. 

8.11 Examine results (b) and (c) of the preceding problem in the high-temperature limit (T Tp) as well 
as in the low-temperature limit (T <g; 7», and compare the resulting expressions with the ones 
pertaining to a nonrelativistic gas and an extreme relativistic gas in three dimensions. 

8.12 Show that, in two dimensions, the specific heat Cy(N, T) of an ideal Fermi gas is identical to the 
specific heat of an ideal Bose gas, for all N and T. 

[Hint: It will suffice to show that, for given N and T, the thermal energies of the two systems differ 
at most by a constant. For this, first show that the fugacities, zp and z B , of the two systems are 
mutually related: 

(\ + zp)(l-z B ) = l, ie., z b = zf/(1 + zf). 

Next, show that the functions / 2 (zf) and £ 2 ( 25 ) are also related: 

fzizp) = j lna z +Z) dz 
0 

It is now straightforward to show that 

Ep(N, T) = E b (N, T) + const., 

the constant being Ef(N,0).] 

8.13 Show that, quite generally, the low-temperature behavior of the chemical potential, the specific 
heat, and the entropy of an ideal Fermi gas is given by 


pL — SF 


1 7t 2 ( 9lna(e)^ 

31ne ) e=ep 



and 

7n 

Ci/^S~ —k 2 Ta(e F ), 

where a(e) is the density of (the single-particle) states in the system. Examine these results for a gas 
with energy spectrum e oc p s , confined to a space of n dimensions, and discuss the special cases: 
s = 1 and 2, with n = 2 and 3. 

[Hint: Use equation (E.18) from Appendix E.] 

8.14 Investigate the Pauli paramagnetism of an ideal gas of fermions with intrinsic magnetic moment p * 
and spin Jh(J = |, and derive expressions for the low-temperature and high-temperature 
susceptibilities of the gas. 
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8.15 Show that expression (8.2.20) for the paramagnetic susceptibility of an ideal Fermi gas can be 
written in the form 


nfi* 2 fi /2 (z) 
kT f 3 / 2 (z)' 


Using this result, verify equations (8.2.24) and (8.2.27). 

8.16 The observed value of y, see equation (8.3.6), for sodium is 4.3 x 10 4 cal mole ' K~ 2 . Evaluate 
the Fermi energy ep and the number density n of the conduction electrons in the sodium 
metal. Compare the latter result with the number density of atoms (given that, for sodium, 

p = 0.954gcm~ 3 andM = 23). 

8.17 Calculate the fraction of the conduction electrons in tungsten (ep = 9.0 eV) at 3000 K whose kinetic 
energy e(= \ mu 2 ) is greater than W (= 13.5 eV). Also calculate the fraction of the electrons whose 
kinetic energy associated with the z-component of their motion, namely (\ mu 2 ) , is greater than 
13.5 eV. 

8.18 Show that the ground-state energy Eq of a relativistic gas of electrons is given by 


E 0 = 


nVm 4 c 5 
3 h 3 


B(x), 


where 


B(x) = 8x 3 {(x 2 + 1) 1/2 - 1) - A(x), 


A(x) and x being given by equations (8.5.13) and (8.5.14). Check that the foregoing result for£o and 
equation (8.5.12) for Pq satisfy the thermodynamic relations 

Eo + PoV = Nfio and Pq = — (9Eo/dV)N. 

8.19 Show that the low-temperature specific heat of the relativisitic Fermi gas, studied in Section 8.5, is 
given by 

Cy _ 2 (x 2 + l) 1 / 2 kT ( X= PF_\ 

Nk x 2 me 2 \ me)' 

Check that this formula gives correct results for the nonrelativistic case as well as for the extreme 
relativistic one. 

8.20 Express the integrals (8.6.19) in terms of the initial slope of the function O(x), and verify 
equation (8.6.20). 

8.21 The total energy E of the electron cloud in an atom can be written as 

E = K + V ne + V ee , 


where K is the kinetic energy of the electrons, V ne the interaction energy between the electrons and 
the nucleus, and V ee the mutual interaction energy of the electrons. Show that, according to the 
Thomas-Fermi model of a neutral atom, 

7 1 

K = —E, V ne = +-E, and V ee = — E, 

so that total V = V ne + V ee = 2E. Note that these results are consistent with the virial theorem; see 
Problem 3.20, with n = — 1. 

8.22 Derive equations (8.4.3) through (8.4.5) for a Fermi gas in a harmonic trap. Evaluate equations 
(8.3.4) and (8.3.5) numerically to reproduce the theoretical curves shown in Figures 8.9 and 8.10. 






Thermodynamics of 
the Early Universe 


Over the course of the twentieth century, astronomers and astrophysicists gathered a vast 
body of evidence that indicates the universe began abruptly 13.75 ± 0.11 billion years ago 
in what became known as the “Big Bang.” 1,2 The intense study of the origin and evolution 
of the universe has led to a convergence of physics and astrophysics. Thermodynamics 
and statistical mechanics play a crucial role in our understanding of the sequence of transi¬ 
tions that the universe went though shortly after the Big Bang. These transitions left behind 
mileposts that astrophysicists have exploited to look back into the earliest moments of the 
universe. The early universe provides particularly good examples for utilizing the proper¬ 
ties of ideal classical, Bose, and Fermi gases developed in Chapters 6, 7, and 8, and the 
theory of chemical equilibrium developed in Section 6.6. 


9.1 Observational evidence of the Big Bang 

Observational evidence of the Big Bang has grown steadily since Edwin Hubble’s discovery 
in the late 1920s that the universe was expanding. Since that time a coherent standard 
model for the beginning of the universe has emerged. The following three items describe 
the key bodies of evidence. 

1. Nearly every galaxy in the universe is moving away from every other galaxy and the 
recessional velocities display an almost linear dependence on the distance between 
galaxies; see Figure 9.1. Hubble was the first to observe this by measuring both the 
distances to nearby galaxies and their velocities relative to our own galaxy. The former 
is based on standard candles, in Hubble’s case Cepheid variable stars with known 
absolute mean luminosity. The latter is based on measurements of the Doppler red 
shift of spectral lines. Type la supernovae are used as the standard candle in the most 
distant observations made using the Hubble Space Telescope. The data are 


'For excellent overviews and history of the study of the Big Bang, see The First Three Minutes: A Modern View of the 
Origin of the Universe by Weinberg (1993) and The Big Bang by Singh (2005). Cosmology by Weinberg (2008) provides 
an excellent technical survey. The organization of this chapter is based on Weinberg (1993). The 2010 decadal survey 
of astrophysics New Worlds, New Horizons in Astronomy and Astrophysics by the National Academies Press provides an 
overview of the current state of the field; see www.nap.edu. 

2 Steady state cosmology advocate Fred Hoyle coined the term "Big Bang” derisively in a BBC radio broadcast in 1950. 
To his eternal dismay, the name quickly became popular. 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00009-8 
© 2011 Elsevier Ltd. All rights reserved. 
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FIGURE 9.1 Hubble diagram of the spectral red-shift velocity of relatively nearby galaxies versus their distance 
using several astronomical standard candles. The velocity is in km/s and the distance is measured in megaparsecs, 
where 1 Mpc = 3.26 x 10 6 light-years. The best fit to the data gives a value of the Hubble parameter as Ho = 
72±8kms -1 Mpc -1 . The Hubble parameter has been recently updated by Riess et al. (2009) to give Ho = 
74.2±3.6kms -1 Mpc -1 . The figure is from Freedman et al. (2001) and is reproduced by permission of the AAS. 


encapsulated in the Hubble-Friedmann relation (Friedmann, 1922, 1924), 



( 1 ) 


Here a represents the distance between any two points in space that grows with time 
as the universe expands, v is the recessional velocity, G is the universal constant of 
gravitation, c is the speed of light, and u is the energy density of the universe. 3 The 
Hubble parameter, H, is the characteristic expansion rate and is of the order of the 
inverse of the age of the universe. The particular form of equation (1) assumes, as 
appears to be the case, that the energy density u is equal to the critical value so that the 

3 Cosmological and general relativistic calculations are usually expressed in terms of the equivalent mass density 
p = u/c 2 = r 00 where T is the energy-momentum tensor. Astrophysicists usually describe the length scale parameter a 
in terms of the Doppler shift factor z = (A/hab — 1), where q a b is the laboratory wavelength of a spectral line and X is the 
red-shifted value. This gives z= (T/To - 1) where To is the current cosmic microwave background temperature and T is 
the photon temperature of that era. For example, the Doppler shift from the era of last scattering is 


z — 



so the universe has expanded by a factor of 1100 since that time. 
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space-time is flat. This means that the universe is balanced on a knife edge between 
expanding forever and recollapsing due to gravity. For excellent technical surveys, see 
Borner (2003) and Weinberg (2008). The measured value of the Hubble parameter is 

Hq = 74.2 ± 3.6 km s' 1 Mpc“\ (2) 

where Mpc is a megaparsec, about 3.26 x 10 6 light-years; see Freedman et al. (2001) 
and Riess et al. (2009). 

2. Penzias and Wilson (1965) observed a nearly uniform and isotropically distributed 
microwave radiation noise coming from deep space with a blackbody temperature of 
about 3 K. This cosmic microwave background (CMB) was quickly identified as the 
remnant blackbody radiation from the era following the Big Bang. Later, balloon 
experiments and space-based measurements by the Cosmic Background Explorer 
(COBE) NASA mission showed that the CMB is extremely uniform and isotropic with 
an average temperature of 7 cmb = 2.725 ± 0.002K; see Mather et al. (1994, 1999), 
Wright et al. (1994), Fixsen et al. (1996), and Figure 9.2. The NASA Wilkinson 
Microwave Anisotropy Probe (WMAP) mission mapped the angular variation of the 
CMB temperature. Figure 9.3 shows the ±200 /zK CMB temperature variations mapped 
onto galactic coordinates. 

The CMB represents the photons that were in thermal equilibrium with the 
high-temperature plasma that existed from the very first moments of the universe 
until it cooled down to approximately 3000 K about 380,000 years after the Big Bang. 



FIGURE 9.2 Cosmic microwave background spectrum from COBE fit numerically to the Planck distribution with an 
average temperature T = 2.725 ± 0.002K; see equations (7.3.8) and (7.3.9), and Figure 7.13. The error bars at the 43 
equally spaced frequencies from the Far Infrared Absolute Spectrophotometer (FIRAS) data are too small to be seen 
on this scale. Figure courtesy of NASA. 
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FIGURE 9.3 Measurement of temperature variations in the CMB using 7 years of data from WMAP. This shows the 
distribution of the CMB blackbody temperature mapped onto galactic coordinates. The variations represent 
temperature fluctuations of ±200/tK. Figure courtesy of NASA and the WMAP Science Team. 


As the temperature fell below 3000 K, the electrons and protons in the plasma 
combined for the first time into neutral hydrogen atoms, a period that is known rather 
oxymoronically as the era of recombination. After this era of “last scattering’’ of 
photons by free electrons, the quantum structure of the atoms prevented them from 
absorbing radiation except at their narrow spectral frequencies, so the universe 
became transparent and the blackbody radiation quickly fell out of equilibrium with 
the neutral atoms. As the universe continued to expand, the wavelengths of the 
blackbody radiation grew linearly with the expansion scale of the universe a. The 
photon number density fell as a ~ 3 and the energy density as a~ A , so the Planck 
distribution, equations (7.3.8) and (7.3.9), was preserved with a blackbody 
temperature that scaled as 


T(t)a(t) = const. (3) 

Measurements of the Hubble parameter and the COBE and WMAP measurements 
of the temperature and temperature fluctuations of the CMB allow a determination of 
the current total energy density of the universe and its composition. The current 
energy density of the universe is 


u = 


3c 2 Hq 


= 8.36 x 10 -10 JrrT 3 , 


8t rG 


(4) 
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and is comprised of approximately 72.8 percent dark energy, 22.7 percent dark matter, 
and 4.56 percent baryonic matter (protons and neutrons]. 4 This gives a baryon 
number density rig of 0.26 m -3 . The number density of photons in a blackbody 
enclosure as a function of temperature is given by equation (7.3.23): 



(5) 


At the current temperature of 2.725 K, this gives a CMB photon number density of 
tiy = 4.10 x 10 8 m -3 , so the current baryon-to-photon ratio is 



( 6 ) 


The ratio rj has remained constant as the universe has expanded since both these 
quantities scale as a -3 (f). As we will see later, the numerical value of r) played a very 
important role in the thermal evolution of the early universe. 5 
3. The relative abundances of the light elements 1 H, 2 H, 3 He, 4 He, 7 Li, and so on 
created during the first few minutes of the universe are sensitive functions of the 
baryon-to-photon ratio iy, see Figure 9.4. This connection was first explored by George 
Gamow, Ralph Alpher, and Robert Herman in the late 1940s and early 1950s; see 
Alpher, Bethe, and Gamow (1948), and Alpher and Herman (1948, 1950). 6 

4 The energy content of the universe is parameterized in terms of the fraction of the critical density contained in the 
various constituents. The current values are dark energy: Qa = 0.728 ± 0.016; baryonic matter: Qf, = 0.0456 ± 0.0016; and 
cold dark matter: Q c = 0.227 ± 0.014. The current age of the universe, or lookback time, is to = 13.75 ± 0.11 x 10 9 years. 
The relative contribution from blackbody radiation is about 6 x 10 -5 . These concordance values of the parameters are 
based on WMAP 7-year data and are tabulated in Komatsu et al. (2010). The dark energy is responsible for the accelerating 
expansion of the universe. The energy density proportions were vastly different in the early universe because they scale 
differently with the expansion parameter a. At the time of recombination, the proportions were: dark matter 63 percent, 
baryonic matter 12 percent, relativistic radiation (photons and neutrinos) 25 percent. During the first few moments, 
relativistic particles provided the dominant contribution to the energy. Using the photon:neutrino:electron ratios of 2 : 
21/4 : 7/2 from Table 9.2, the energy content was photons 18.6 percent, neutrinos and antineutrinos 48.8 percent, and 
electrons and positrons 32.5 percent. While dark energy is currently the dominant contribution to the energy density 
of the universe, it played only a small role in the early evolution of the universe. Cold dark matter was crucial for the 
development of the first stars and galaxies at the end of the “dark ages” 100 to 200 million years after the last scattering. 

5 The proper measure here is the ratio of the baryon number density to photon entropy density but, since the CMB 
photon entropy density and number density both scale as T 3 , the ratio is usually quoted in terms of the ratio of the 
number densities. 

6 George Gamow and Ralph Alpher in 1948, and Alpher and Robert Herman in 1950, proposed a model for nucleo¬ 

synthesis in a hot, expanding primordial soup of protons, neutrons, and electrons. Alpher and Gamow called this 

material “ylem.” To account for the present abundance of 4 He in the universe, Alpher and Herman (1950) proposed 

a baryon-to-photon ratio of roughly 10 -9 and predicted a current cosmic microwave background temperature of about 
5K. Gamow added his friend Hans Bethe’s name as second author to Alpher, Bethe, and Gamow (1948) as a pun on 
the Greek alphabet. The paper was published, perhaps not coincidentally, on April 1; see Alpher and Herman (2001), 
Weinberg (1993), and Singh (2005). 
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FIGURE 9.4 Calculated primordial abundances of light elements ( 4 He, D= 2 H, 3 He, and 7 Li) as functions of the 
baryon-to-photon ratio. The baryon-to-photon ratio is given by rj = 2.7 x 10 ~ B Q/,h 2 , where h is the Hubble 
parameter in units of 100(km/s)/Mpc and Q/, = 0.046 is the current baryonic fraction of the mass-energy density of 
the universe; see Copi, Schramm, and Turner (1997), Schramm and Turner (1998), and Steigman (2006). The 
experimentally allowed range is in the grey vertical bands. Figure from Schramm and Turner (1998). Reprinted with 
permission; copyright © 1998, American Physical Society. 


9.2 Evolution of the temperature of the universe 

As the universe expanded and cooled, the cooling rate was proportional to the Hubble 
parameter, that is, of the order of the inverse of the age of the universe at that point in 
its expansion. This led to a sequence of important events when different particles and 
interactions fell out of equilibrium with the gas of blackbody photons. The neutrinos and 
neutron-proton conversion reactions fell out of equilibrium at t 1 second. Nuclear reac¬ 
tions that formed light nuclei fell out of equilibrium at t 3 minutes. Neutral atoms fell 
out of equilibrium at t ^ 380,000years. All these degrees of freedom froze out when the 
reaction rates that had kept them in equilibrium with the blackbody photons fell far below 
the cooling rate of the expanding universe. Each component that fell out of equilibrium 
left behind a marker of the properties of the universe characteristic of that era. It is these 
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markers that provide evidence of the properties and behavior of the universe during its 
earliest moments. 

From the first moments of the universe up until the recombination era 380,000 years 
later, the cosmic plasma was in thermal equilibrium with the blackbody radiation through 
Thomson scattering. Due to the high density of charged particles, the photon scattering 
mean free time was much shorter than the time scale for temperature changes of the uni¬ 
verse as it expanded and cooled, which kept the plasma in thermal equilibrium with the 
photons. For the first few hundred thousand years of its expansion, the energy density of 
the universe was dominated by photons and other relativistic particles. This is because the 
energy density of the blackbody radiation scales as a ~ 4 whereas the energy density of non- 
relativistic matter scales as a -3 . The temperature of the blackbody photons as a function 
of the age of the universe is shown in Figure 9.5 and Table 9.1. 

During the first one-hundredth of a second, the universe expanded and cooled from its 
singular beginning to a temperature of about 10 11 K. The physics from this time onward 
was controlled by the weak and electromagnetic interactions. The strong interactions 
could be ignored since the baryon-to-photon ratio was so small and the temperature was 



FIGURE 9.5 Sketch of the photon temperature versus the age of the universe. At early times, radiation dominated 
the energy density, so T ~ t -1 / 2 . At later times (f > 10 13 s) nonrelativistic matter dominated the energy density, so 
T ~ f -2 / 3 . In the current dark energy dominated stage, the universe is beginning to expand exponentially with 
time, so the photon temperature is beginning to fall exponentially. 
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Table 9.1 Temperature 
vs. Age of the Universe 


Time (s) 

Temperature (K) 

0.01 

1 x 10 11 

0.1 

3 x 10 10 

1.0 

X 

o 

o 

12.7 

3 x 10 9 

168 

1 x 10 9 

1980 

3 x 10 8 

1.78 x 10 4 

10 8 

1.20 x 10 13 

3000 

4.34 x 10 17 

2.725 


Source : Weinberg (2008). 


too low to create additional hadrons. 7,8 We will follow the thermodynamic behavior of the 
universe from t — 0.01 second when the temperature was 10 11 K to t = 380,000 years when 
the temperature fell below 3000 K. At that point neutral atoms formed, photon scattering 
ended, and the universe became transparent to radiation. After recombination and last 
scattering there were no new sources of radiation in the universe since the baryonic mat¬ 
ter consisted entirely of neutral atoms. This state of affairs lasted until atoms were first 
reionized by the gravitational clumping that formed the first stars and galaxies 100 to 200 
million years after the Big Bang. This reionization epoch ended the so-called cosmic dark 
ages. 


9.3 Relativistic electrons, positrons, and neutrinos 

During the earliest moments of the universe, the temperature was high enough to cre¬ 
ate several kinds of relativistic particles and antiparticles. If kT » me 2 , then particle- 
antiparticle pairs each with mass m can be created from photon-photon interactions. 
At these temperatures, almost all of the particles that are created will have an energy- 
momentum relation described by the relativistic limit, namely 

gfc ~ hek, (1) 

7 Before time f = 0.01s, the analysis is more difficult due to the production of strongly interacting particles and 
antiparticles. At even earlier times, when the temperature was above kT 300 MeV (T = 4x 10 12 K), hadrons would 
have broken apart into a strongly interacting relativistic quark-gluon plasma. The Relativistic Heavy Ion Collider at 
Brookhaven National Laboratory has succeeded in creating a quark-gluon plasma with the highest temperature matter 
ever created in the laboratory, T = 4 x 10 1Z K; see Adare et al. (2010). 

s The exact mechanism for baryogenesis (i.e., nonzero baryon-to-photon ratio r?) is unsettled. It requires, as shown by 
Sakharov (1967), three things: baryon number nonconservation, C and CP violation, and deviation from equilibrium. All 
these conditions were satisfied in the earliest moments of the universe (far earlier than the time scales we examine here) 
but a consensus theory that allows for a baryon asymmetry nearly as large as the observed value of ?; = 6 x 10 -I ° has not 
yet emerged. 
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where hk is the magnitude of the momentum. This relation applies to photons, neutrinos, 
antineutrinos, electrons, and positrons. The threshold for electron-positron pair forma¬ 
tion is m e c 2 /k = 5.9 x i0 9 K. Neutrinos are very light, so we can safely assume that they 
are relativistic. 9 The relativistic dispersion relation 1 gives essentially the same density of 
states for all species of relativistic particles: 

°° 2 

m hck)dk - 2iw (2) 

0 


where g s is the spin degeneracy. Photons have a spin degeneracy g s = 2 (left and right cir¬ 
cularly polarized). The other species are all spin- f fermions. Electrons and positrons have 
spin degeneracy g s = 2 while neutrinos and antineutrinos have spin degeneracy g s — I 
since all neutrinos have left-handed helicity. 

During this era, because of the charge neutrality of the universe and the small size of the 
baryon-to-photon ratio rj, the number densities of the electrons and positrons were nearly 
equal, so their chemical potentials were both rather small. Assuming that the net lepton 
number of the universe is also small, the same applies to the neutrinos and antineutrinos. 
As explained in Section 7.3, the chemical potential for photons is exactly zero. The pres¬ 
sure, number density, energy density, and entropy density of a relativistic gas of fermions 
(+) or bosons (—) with zero chemical potential are are given by 


P(T) 


n(T) 


u(T) 


s(T ) 



(3a) 


(3b) 


(3c) 


(3d) 


9 A11 three neutrino families are known to have small (but nonzero) mass from neutrino oscillation observations. 
The electron neutrino is probably the lightest with the experimental limit of m Ve c 2 < 2.2 eV. The distribution of angular 
fluctuations of the CMB measured by WMAP puts a limit on the sum of the masses of the neutrinos, "^gm v cr < 0.58eV, 
so we can safely assume that all neutrino species are far lighter than the value of kT during the early universe. 
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Using the values of the Bose integrals from Appendix D, we arrive at the following 
expressions for the blackbody photons: 


P y (T) = 


n 2 ( kT) 4 
45 (he) 3 ’ 


n y (T) 


2f(3) / kT\ 3 
n 2 \ he) 


Uy(T) 


Tt 2 (kT) 4 
15 (he) 3 ’ 


Sy(T) = 


4n 2 k 

45 



(4a) 

(4b) 

(4c) 

(4d) 


All relativistic species with p — 0 have the same power law temperature dependences for 
the pressure, energy density, and so on, as the photons, while the Fermi and Bose integrals 
are the same except for a constant prefactor: 


/ 


x n ~ l 

e x +l 


o 


dx = 




x ”- 1 
e x —1 


dx; 


(5) 


see Appendices D and E. The contributions to the pressure, energy density, and entropy 
density result from counting the spin degeneracies, the number of particles and antiparti¬ 
cles, and accounting for the different Fermi/Bose factors (1 for bosons, 7/8 for fermions). 
The photons, three generations of neutrinos (electron, muon, and tau neutrinos and their 
antiparticles), and the electrons and positrons contribute to the total pressure, number 
density, energy density, and entropy density in the proportions shown in Table 9.2. The 
counting is presented here, as is usually done in the literature, relative to the contribu¬ 
tion per spin state of the photons. The contributions to the number densities are the same 
except that the Fermi/Bose factor is now 3/4. 


Table 9.2 Relativistic Contributions to Pressure, Energy 
Density, and Entropy Density 


Particles 

Fermi/Bose Factor 

Spin Degeneracy 

Number of Species 

2^0131/^ 

y 

Ve.V/i'Vz 

1 

7 

8 

2 

1 

1 

3 

2 

21 

8 

Ve.Vfi'Vr 

7 

8 

1 

3 

21 

8 

e~ 

7 

8 

2 

1 

7 

4 

e+ 

7 

8 

2 

1 

7 

4 
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The totals then are 

/ 21 7\ P V (T ) /43\ 

T,o,ai(r)=(^2+ T + -J^— = ^-jp Y (T), (6a) 

/ 21 7\ u v (T) /43\ 

M total(T) = ( 2+ — + -J ——— = l — )uy(T), (6b) 

/ 21 7) s y (T) /43\ 

5 total(T) — (2+ — + -j —-— — ( — J Sy(T), (6c) 

w total(T) = |^2+ - + 3J ——— = ^— Jn Y (T). (6d) 

The density of the universe was high enough in this era, so the weak and electromagnetic 
interaction rates kept all these species in thermal equilibrium with one other. Therefore, 
as the universe expanded adiabatically, the entropy in a comoving volume of linear size a 
remained constant as the volume expanded from some initial value a jj to a final volume a \: 

%>tal(To)flo — Stotal(Ti)fli. (7) 

Since the entropy density is proportional to T 3 , the temperature and length scale at time t 
are related by 


T{t)a{t) = const. (8) 

This is the same relation that applies for a freely expanding photon gas, see equa¬ 
tion (9.1.3), buthere it arises from an adiabatic equilibrium process. From equations (9.1.1) 
and (8), the temperature of the universe as a function of the age of the universe t during 
this era is 


T(t) = 10 10 K, 


0.992s 

t 


(9) 


see Problem 9.1. 


9.4 Neutron fraction 

During the first second of the universe, when T > 10 10 K, and before protons and neu¬ 
trons combined into nuclei, the weak interaction kept the free neutrons and protons in 
thermal “beta-equilibrium” with each other and with the photons, neutrinos, electrons, 
and positrons through the processes 

n + v e ^±p + e~ + y, 
n + e + p + v + y, 
n p + e~ + v + y. 


(la) 

(lb) 

(lc) 
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We can treat this as a chemical equilibrium process, as described in Section 6.6. Since the 
chemical potentials of the photons, electrons, positrons, neutrinos, and antineutrinos are 
all zero, the neutron and proton chemical potentials must be equal at equilibrium: 


fT-n — Ap- ( 2 ) 

At these temperatures (s» 10 11 K) and densities (» 10 32 m~ 3 ), the protons and neutrons can 
be treated as a classical nonrelativistic ideal gas. Following equation (6.6.5), the spin-| 
proton and neutron chemical potentials are 

Up = mpC 2 + kT\n(jip>^j — fc7Tn2, (3a) 

pi„ = m n i? + kT In ( n n k^j — kT ln2. (3b) 

where A.(= h/yfZjtmkJ) is the thermal deBroglie wavelength. The rest energy of the 
neutron is greater than the rest energy of the proton by 


m n c 2 — rripC 2 = A e = 1.293MeV. (4) 

Ignoring the small mass difference in the thermal deBroglie wavelength in equations (3a) 
and (3b) gives 


n n = n p e flAe . 


(5) 


The baryon number density is the sum of the neutron and proton number densities 

n B = n n + rip, (6) 


so the equilibrium neutron fraction is given by 


nn _ 1 

n B e^ Ae + 1 ’ 


(7) 


The mass difference gives a crossover temperature T np = As/k ~ 1.50 x 10 10 K, so the 
neutron fraction drops from 46 percent when T = 10 11 K to 16 percent when T = 9 x 10 9 K 
at l] ~ 1 second. As the temperature fell below 10 10 K (fcT = 0.86MeV), the weak interaction 
rate began to fall far below the cooling rate of the universe, so the baryons quickly fell out of 
equilibrium with the neutrinos. From that time onward the neutrons began to beta-decay 
with their natural radioactive decay lifetime of r n — 886 seconds, so the neutron fraction 
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fell exponentially: 


g«0.16exp^ - J for t > t\ = Is. (8) 

By the time of nucleosynthesis, about 3.7 minutes later, the neutron fraction had dropped 
to q » 0.12. At that point, the remaining neutrons bound with protons to form deuterons 
and other light nuclei. For a discussion of nucleosynthesis, see Section 9.7. 


9.5 Annihilation of the positrons and electrons 

About one second after the Big Bang, the temperature approached the crossover tempera¬ 
ture T e for creating electron-positron pairs: 


/cr e = mgC 2 = 0.511 Me V, (1) 

with T e — 5.93 x 10 9 K. As the temperature fell below T e , the rate of creating e + e~ pairs 
began to fall below the rate at which pairs annihilated. The full relativistic dispersion 
relation for electrons is 


Sk = 


yj (Kck) 2 


+ (m e c 2 ) 


2\2 


( 2 ) 


which gives for density of states 

OO . - 

8tt r „ eJs 2 — {m e c 2 ) 2 , ■> 

a e (e)= ——=- / k 2 8(s-e k )dk= - ? , - fore >m e c 2 . (3) 

o 

Since the electrons and positrons were in equilibrium with the blackbody photons via the 
reaction 


e + +e +±y + y, (4) 

the equilibrium equation (6.6.3) implied that the chemical potentials of the species were 
related by 

fl- + ll + =2fly =0. (5) 

The ratio of the number density of the electrons to that of the photons then was 

OO - - 

1 I" xJx 2 — (fim e c 2 ) 2 

— =- / - 5 - dx, 

n y 24t(3) J + i 

ftm e c 2 


(6) 
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while the positron density ratio was 

OO . - 

n+ 1 t Xy/x 2 - {/3m e c 2 ) 2 ^ x . 
hy ~ 2? (3) J eXePr- + l x ' 

pm e c 2 


see equation (9.1.5). The electron and positron densities became unbalanced as the 
universe cooled. 

Eventually all the positrons got annihilated leaving behind the electrons that currently 
remain. Charge neutrality of the universe required the difference between the number 
density of electrons and the number density of positrons to be equal to the number density 
of protons, (1 — q)riB, where q is defined in Section 9.4; hence 


OO -- 

(n_ - n+) _ sinh(^M-) f *V * 2 - Q 3m e c 2 ) 2 _ 

n Y 2f(3) J cosh(x) + cosht/Jjtr-) X ^ ^ 

fim e c 2 


( 8 ) 


We can use equation (8) to determine the electron chemical potential as a function of tem¬ 
perature numerically and then use that value in equations (6) and (7) to determine the 
electron and positron densities; see Figure 9.6. 

Initially, the electron and positron densities both decreased proportional to 
exp(— /3m e c 2 ) as the temperature fell below the electron-positron pair threshold, but they 



FIGURE 9.6 The ratio of the electron and positron densities to the photon density as a function of fSm e c 2 during the 
e + e~ annihilation for rj = 6 x 10~ 10 . This era began around temperature 10 10 K ( ftm e c 2 = 1.7) at time t = 1 second 
and ended when the temperature was about 3 x 10 8 K ( fim e cr — 20) at time t = 33 minutes when the electron 
number density leveled off at the proton number density. 
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remained nearly equal to each other until T & m e c 2 /Adn[l/(l — q)i 7 ] 3 x 10 8 K. At that 
temperature, the electron density began to level off at the proton density while the positron 
density continued to fall. 

Using the baryon-to-photon ratio i; = 6 x 10 _1 °, we infer that during the first second 
of the universe that for every 1.7 billion positrons there must have been one extra elec¬ 
tron. It is these few extra electrons that will combine with nuclei during the recombination 
era composing all the atoms now present in the universe. All baryonic matter currently 
in the universe is the result of this initial asymmetry between matter and antimatter; 
see footnote 8 . 


9.6 Neutrino temperature 


For temperatures above T = 10 10 K, the rates for the weak interaction reactions (9.4.1) kept 
the neutrinos in “beta-equilibrium” with the electrons, positrons, and photons. Starting at 
time t » 1 s, when T = 10 10 K, the weak interaction rates began to fall far below the expan¬ 
sion rate of the universe so the neutrinos quickly fell out of equilibrium. Following the 
decoupling, the neutrinos expanded freely so the neutrino temperature scaled with the 
expansion length scale following equation (9.1.3). 

The system of electrons, positrons, and photons remained in thermal equilibrium with 
each other and expanded adiabatically during the electron-positron annihilation era from 
temperature T 0 = 10 10 K when the annihilations began, until temperature 7\ = 3 x 10 8 K 
when nearly all of the positrons had been annihilated. Since this was an adiabatic expan¬ 
sion, we can determine the temperature evolution using entropy conservation. Consider 
a comoving cubical volume that expanded from an initial linear size a 0 to a final size a\ 
during the same time period. The total entropy in the comoving volume at temperature T 0 
was due to the photons, electrons, and positrons (refer to Table 9.2): 


S(T 0 ) = — s y (T 0 )al 


( 1 ) 


while the entropy at temperature T\ was due solely to the photons since, by then, nearly 
all of the electrons and positrons had been annihilated: 


S(Ti) = Sy(Ti)a\. 


( 2 ) 


Entropy conservation during the adiabatic expansion relates the initial and final tem¬ 
peratures as 



(3) 


In essence, the entropy of the annihilating electrons and positrons was transferred to the 
photons. Since the neutrino and photon temperatures were equal before the electron- 
positron annihilation and the neutrinos expanded freely during the annihilation, the 
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neutrino temperature decreased more than the photon temperature during the annihi¬ 
lation era: 


T vl = (4/11) 1/3 7i. (4) 

After the e + e~ annihilation, both the neutrino and the photon temperatures evolved 
according to equation (9.1.3) and (9.3.8), so the current temperature of the relic Big Bang 
neutrinos should be 


T v = (4/ll) 1/3 r CMB ~ 1.945K. (5) 

A measurement of the cosmic neutrino background would provide an excellent additional 
test of the standard model of the Big Bang but we do not currently have a viable means to 
measure these very low-energy neutrinos. 10 


9.7 Primordial nucleosynthesis 

Light nuclei other than hydrogen first formed between 3 and 4 minutes after the Big Bang 
when the temperature had cooled to about 10 9 K. Prior to that time, the high-temperature 
blackbody radiation rapidly photodissociated any deuterium nuclei that happened to 
form. The first step for the formation of light nuclei from the protons and neutrons is the 
formation of deuterium because all of the rates for forming nuclei at these densities are 
dominated by two-body collisions. Once deuterons formed, most of these nuclei would 
have been quickly converted to helium and other more stable light nuclei in a series of 
two-body collisions with the remaining protons, neutrons, and with each other. As dis¬ 
cussed in Section 9.4, the proton/neutron mixture at this time was about q = 12 percent 
neutrons and 1 - q = 88 percent protons. By t ~ 3 minutes the temperature had fallen to 
T « 10 9 K so protons and neutrons could begin to bind themselves into deuterons via the 
process 


p+nT± d+y. (1) 

The chemical equilibrium relation for this reaction, see equation (6.6.3), is 


dp + dn — ddy ( 2 ) 

since the chemical potential of the blackbody photons is zero. At these temperatures and 
densities the protons, neutrons, and deuterons can be treated as classical ideal gases. 

10 The neutrino elastic scattering cross-section scales like the fourth power of the energy, so the collisions are both 
very rare and involve very small energy and momentum transfers. This makes direct laboratory detection of the cosmic 
neutrino background (Ci’B) infeasible at present; see Gelmini (2005). 
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The proton and neutron are spin- \ particles so they have two spin states each while the 
deuteron is spin-1 and has three spin states: 


/jL p = nipC 2 + kT\n (lipX 2 ^ — kT ln2, 

(3a) 

fi n — m n c z + kT In (ji n X^J — kT ln2, 

(3b) 

Mri = mac 2 + kT\n (n d xfj - kT ln3. 

(3c) 

The binding energy of the deuteron is = m p c 2 + m n c 2 - m d c 2 
deuteron is approximately twice as massive as protons or neutrons, 
density is given by 

= 2.20 MeV. Since the 
the deuteron number 

n d = jitprin-^e^b ss -^-n p n n X 2 e^ e b. 

4 X d V2 ■ 

(4) 

The total number density of baryons is determined by the baryon-to-photon ratio: rj: 
ub = n n Y = n p + n n + 2 n d . The neutron number density is qua — n n + n d , so the deuteron 
fraction is given by 

fd=~ = (1 -c/-fd)(q~fd)s, 
n B 

(5) 

where the parameter s is 


s _ i2f(3) / a ^ 

\j7i \ m p c z ) 

(6) 


see also equation (9.1.5). Equation (5) is similar to the Saha equation for the ionization of 
hydrogen atoms that will be discussed in Section 9.8 and has solution 

1 + s - ^(1 + s) 2 -4s 2 q(l- q) 


For high temperatures, 5 is small and f d % q( 1 — q)s while for low temperatures, 5 is large 
and /rf ~ q, that is, all the neutrons are bound into deuterons. The deuterium fraction as a 
function of temperature is shown in Figure 9.7. The small values of the baryon-to-photon 
ratio 7] and £>,/ iripC 2 delayed the nucleosynthesis until the temperature had fallen to 


kT n - 



(8) 


providing the time for the neutron fraction to have decayed to q — 0.12. 
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FIGURE 9.7 Plot of the equilibrium deuterium fractionversus temperature T for neutron fraction q = 0.12 
and baryon-to-photon ratio ij = 6x 10 -10 . As T falls below about 6 x 10 8 K the neutrons are nearly all bound into 
deuterons. Further two-body reactions convert most of the deuterium into heavier nuclei, primarily 4 He. 


The simple equilibrium calculation presented here assumes that no further reactions 
take place. Including the fast nonequilibrium two-body reactions, namely 


d -\- d — 

■* 3 H + p+y, 

(9a) 

d -\- d — 

■> 3 H e + n + y, 

(9b) 

d+ 3 H- 

-* 4 H e + n + y, 

(9c) 

d+ 3 He - 

-> 4 H e + p+y, 

(9d) 


results in almost all of the deuterons being cooked into the very stable isotope 4 He and 
small amounts of other light nuclei. Since each 4 He nucleus is composed of two protons 
and two neutrons, this gives a helium mass fraction of 2 q — 24 percent and proton mass 
fraction of 1 — 2q — 76 percent. The complete calculation involves nonequilibrium effects 
modeled with rate equations for each of the nuclear interactions, including those for heav¬ 
ier isotopes, but that only changes the predicted concentration for 4 He slightly; 11 see 
Weinberg (2008). The largest theoretical uncertainty is, remarkably, the uncertainty in the 
radioactive decay time of the neutron in equation (9.4.8); see Copi, Schramm, and Turner 
(1997). Calculations of this type were first performed by Gamow, Alpher, and Herman in the 
late 1940s and early 1950s. Based on current amounts of helium and other light elements 

11 Nuclear reactions continued slowly at a rate that had fallen out of equilibrium and shifted the isotopic ratios until 
about t 10 minutes. 
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in the universe, Alpher and Herman predicted a 5 K cosmic microwave background over a 
decade before Penzias and Wilson’s discovery; see footnote 6. 


9.8 Recombination 

After the nucleosynthesis took place in the first few minutes, the universe continued to 
cool, with the nuclei and electrons remaining as an ordinary plasma in thermal equilib¬ 
rium with the photons. It took several hundred thousand years for the temperature to drop 
below the atomic ionization energies of a few electron volts needed for nuclei to capture 
electrons and form atoms. Hydrogen was the last neutral species to form since it has the 
smallest ionization energy of a Rydberg (lRy= m ( .e 4 /8f ( 2 // 2 = 13.6057eV). At first glance, 
one would think that atoms form when the temperature falls below Ry/k — 158,000 K 
but, as we will see, the huge number of photons per proton delayed recombination until 
T « 3000 K. Once all the electrons and protons formed into neutral hydrogen atoms, the 
universe became transparent due to the last scattering of radiation from free electrons. 
These CMB blackbody photons were suddenly free to propagate and hence have been 
traveling unscattered since that time. 

The recombination reaction (that is, the inverse of the hydrogen photoionization 
reaction) is 


p + e^U + y, (1) 

so the chemical equilibrium relation from Section 6.6 gives 


Mp + Me — MH (2) 

since, again, the chemical potential of the blackbody photons is zero. 12 At the temperatures 
and densities prevailing during this era (a few thousand degrees Kelvin and only about 10 9 
atoms per cubic meter), the electrons, protons, and hydrogen atoms can all be treated as 
classical ideal gases, with the result 


p p = m p c 2 + kTln(ripXp) — kT In 2, 

(3a) 

p e = m e c 2 + kTln(n e xl) — fc2Tn2, 

(3b) 

pn = mnc 2 + kTlninftXfyi — kT ln4. 

(3c) 


The binding energy of hydrogen is m p c 2 + m e c 2 — ni\\c 2 = 1 Ry. The equilibrium condition 
(6.6.3) and the ideal gas chemical potential (6.6.5) then give a simple relation between the 
number densities of the three species: 

«u = n p n e X 2 e pRy , (4) 

12 The same reaction occured for the deuterons that remained after nucleosynthesis at t 3 minutes but the density 
of the deuterons was 3 x 10 -5 times the proton density; refer to Figure 9.4. 
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where 


k e = , (5) 

•J2nm e kT 

is the electron thermal deBroglie wavelength. The number densities of free electrons and 
protons are the same due to charge neutrality: 

n e = n p . (6) 

The protons remaining after nucleosynthesis are either free or combined into hydrogen 
atoms, so 


«p + «h = (1-2 q)n B = (1 -2q)rin y . 


(7) 


Putting equations (3), (4), (6), and (7) together and making use of (9.1.5) gives the Saha 
equation for the neutral hydrogen fraction: 


/h = 


tlH 

rip + «h 


(1 -/h) 2 s, 


( 8 ) 


where the parameter 5 is 


s = 4?(3y-(1-2*7)17 
jt 


kT 
m e c 2 


3/2 


^ R y 


(9) 


The solution to equation (8), namely 


/h = 


1 + 2s - Vl + 4s 
2s 


( 10 ) 


is shown in Figure 9.8. At temperatures above the recombination temperature, s is small so 
/h is small, making the plasma fully ionized. At low temperatures s is large so/h approaches 
unity, leaving just neutral atoms. The small values of the baryon-to-photon ratio rj and 
Ry /m e c 2 make the onset of recombination at temperature 


kT r 



(11) 


which delays the last scattering until T ~ 3000 K; see Figure 9.8. 
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T (kelvin) 


FIGURE 9.8 The equilibrium neutral hydrogen fraction as a function of temperature for baryon-to-phonon ratio 
rj = 6 x 10 -10 and proton fraction 1-2 q = 0.76. By the time temperature T = 3000K, 99.5 percent of the free 
protons and electrons had combined into neutral hydrogen resulting in "last scattering" and the universe became 
transparent. The age of the universe at that time was about 380,000 years. 


9.9 Epilogue 

The formation of neutral atoms about 380,000 years after the Big Bang effectively 
ended the scattering of photons from free charges. The universe became transparent and 
entered the “dark ages” before the first star formation. The CMB photons were no longer 
in equilibrium but maintained their Planck distribution as the universe expanded. Small 
density fluctuations that were present in the electron-proton plasma just before recom¬ 
bination were imprinted on the CMB as temperature fluctuations. The CMB shown in 
Figure 9.3 earlier displays temperature fluctuations of the order of ±200 /iK that represent 
the density fluctuations in the plasma at the time of recombination. These small mass 
density fluctuations led to gravitational clumping that resulted in the formation of the first 
stars and galaxies 100 to 200 million years after the Big Bang. The large fraction of nonbary- 
onic cold dark matter was crucial in this process. Early stars that exploded as supernovae 
spewed their heavy elements (carbon, oxygen, silicon, iron, gold, uranium, etc.) into the 
cosmos. Our own solar system formed from a gas and dust cloud that included heavy 
elements that had been created in an earlier supernova event. Indeed, “we are stardust.” 13 

13 Joni Mitchell, Woodstock ; copyright © Siquomb Publishing Company: 

“We are stardust 
Billion year old carbon 
We are golden 

Caught in the devil’s bargain 
And we’ve got to get ourselves 
back to the garden” 
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Problems 

9.1. Use the Hubble expansion relation (9.1.1), the temperature scaling relation (9.1.3), and the energy 
density relation before the electron-positron annihilation (9.3.6b) to show that the temperature as a 

function of time during the first second of the universe was T(t) « 10 10 K J 09 ^ 2s . 

9.2. Determine the average energy per particle and average entropy per particle for the photons, 
electrons, positrons and neutrinos during the first second of the universe. 

9.3. While the electromagnetic interaction between the photons and the charged electrons and 
positrons kept them in equilibrium with each other during the early universe, show that the direct 
electromagnetic Coulomb interaction energy between the electrons and positrons was small 
compared to the relativistic kinetic energy of those species. Show that the ratio between the 
Coulomb and kinetic energies is of the order of the fine structure constant: 

^coulomb ^ _ 1 

u e 47reoftc 137.036 

9.4. Show that during the early part of the electron-positron annihilation era, the ratio of the electron 
number density to the photon number density scaled with temperature as 



9.5. Show that after nearly all of the positrons were annihilated and the electron number density had 
nearly leveled off at the proton density, the ratio of the positron number density to the photon 
number density scaled with temperature as 



9.6. After the positrons were annihilated, the energy density of the universe was dominated by the 
photons and the neutrinos. Show that the energy density in that era was: u total = (1 + (4/ll) 4 / 3 )u y . 
Next, use the Hubble expansion relation (9.1.1), the temperature scaling relation (9.1.3), and the 
energy density after the electron-positron annihilation to show that the photon temperature as a 

function of time was T ( t ) ~ 10 10 K J L7 t 88s . This relation held from t ~ 100 s until t 200,000 years 
when the energy density due to baryonic and cold dark matter began to dominate. 

9.7. How would the primordial helium content of the universe have been affected if the present cosmic 
background radiation temperature was 27K instead of 2.7K? What about 0.27 K? 

9.8. Gold-on-gold nuclear collisions at the Relativistic Heavy Ion Collider (RHIC) at the Brookhaven 
National Laboratory create a quark-gluon plasma with an energy density of about 4GeV/fm 3 ; see 
Adare et al. (2010). Treat nuclear matter as composed of a noninteracting relativistic gas of quarks 
and gluons. Include the low-mass up and down quarks and their antiparticles (all spin-4), and 
spin-1 massless gluons. Like photons, the gluons are bosons, have two spin states each, and are their 
own antiparticle. There are eight varieties of gluons that change the three color states of the quarks. 
Only the strongly interacting particles need to be considered due to the tiny size of the plasmas. 
What is the temperature of the quark-gluon plasma? 

9.9. Calculate the energy density versus temperature very early in the universe when the tempera¬ 
tures were above kT = 300 MeV. At those temperatures, quarks and gluons were released from 
individual nuclei. Treat the quark-gluon plasma as a noninteracting relativistic gas. At those 
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temperatures, the species that are in equilibrium with one other are: photons, the three neutrino 
species, electrons and positrons, muons and antimuons, up and down quarks and their antiparticles 
(all spin-|), and spin-1 massless gluons. Like photons, the gluons are bosons, have two spin states 
each, and are their own antiparticle. There are eight varieties of gluons that change the three color 
states of the quarks. The strange, charm, top, and bottom quarks and tau leptons are heavier than 
300 MeV, so they do not contribute substantially at this temperature. Use your result and equation 
(9.1.1) to determine the temperature evolution as a function of the age of the universe during this 
era and its age when IcT ~ 300 MeV. 


Statistical Mechanics of Interacting 

Systems: The Method of 
Cluster Expansions 


All the systems considered in the previous chapters were composed of, or could be 
regarded as composed of, noninteracting entities. Consequently, the results obtained, 
though of considerable intrinsic importance, may have limitations when applied to sys¬ 
tems that actually exist in nature. For a real contact between the theory and experiment, 
one must take into account the interparticle interactions operating in the system. This 
can be done with the help of the formalism developed in Chapters 3 through 5 which, 
in principle, can be applied to an unlimited variety of physical systems and problems; 
in practice, however, one encounters in most cases serious difficulties of analysis. These 
difficulties are less stringent in the case of systems such as low-density gases, for which 
a corresponding noninteracting system can serve as an approximation. The mathema¬ 
tical expressions for the various physical quantities pertaining to such a system can be 
written in the form of series expansions, whose main terms describe the correspond¬ 
ing ideal-system results while the subsequent terms provide corrections arising from the 
interparticle interactions in the system. A systematic method of carrying out such expan¬ 
sions, in the case of real gases obeying classical statistics, was developed by Mayer and his 
collaborators (1937 onward) and is known as the method of cluster expansions. Its gener¬ 
alization, which equally well applies to gases obeying quantum statistics, was initiated by 
Kahn and Uhlenbeck (1938) and was perfected by Lee and Yang (1959a,b; 1960a,b,c). 


10.1 Cluster expansion for a classical gas 


We start with a relatively simple physical system, namely a single-component, classical, 
monatomic gas whose potential energy is given by a sum of two-particle interactions uy. 
The Hamiltonian of the system is then given by 



( 1 ) 


the summation in the second part goes over all the N(N— l)/2 pairs of particles in the 
system. In general, the potential uy is a function of the relative position vector r,y(= ry — r,); 
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however, if the two-body force is a central one, then the function uy depends only on the 
interparticle distance r^. 

With the preceding Hamiltonian, the partition function of the system is given by, see 
equation (3.5.5), 


QN(V ’ Ti N \ }pN / 


exp 


-P E (2^P?) - P E u a am P d 

7 i<j J 


*3N r 


( 2 ) 


Integration over the momenta of the particles can be carried out straightforwardly, with 
the result 


Qn(V,T) = 


1 

N\X 3N 


/ 


exp 


i<j 


j3N 


r= ma n Z n{V ’ T) ’ 


(3) 


where ),{— h/(2nmkT) l l 2 } is the mean thermal wavelength of the particles, while the 
function Z N (V, T) stands for the integral over the space coordinates r\, r 2 ,..., />: 


Z N (V, T) = / exp 




-kY.‘ 

i<j 


j3N 


-/n- 

i<] 


(e P Ui i)d 3N r. 


(4) 


The function Zy ( V, T ) is generally referred to as the configuration integral of the system. 
For a gas of noninteracting particles, the integrand in (4) is unity; we then have 

yiV 

Z$\v,T) = V n and Q^\v,T)=— m> (5) 

in agreement with our earlier result (3.5.9). 

To treat the nonideal case we introduce, after Mayer, the two-particle function fj, 
defined by the relationship 


fij = e- pu ‘j - 1. (6) 

In the absence of interactions, the function^ is identically zero; in the presence of interac¬ 
tions, it is nonzero but at sufficiently high temperatures it is quite small in comparison with 
unity. We, therefore, expect that the functions fj would be quite appropriate for carrying 
out a high-temperature expansion of the integrand in (4). 

A typical plot of the functions Uy and fj is shown in Figure 10.1; we note that (i) the 
function fj is everywhere bounded and (ii) it becomes negligibly small as the interparticle 
distance r,y becomes large in comparison with the “effective” range, ro, of the potential. 
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FIGURE 10.1 A typical plot of the two-body potential function Uy and the corresponding Mayer function/jy. 


Now, to evaluate the configuration integral (4), we expand its integrand in ascending 
powers of the functions^-: 


Z n (V,T)= I ]~[(1 +fij)d 3 ri ■■■d 3 r N 

i<i 

= f [ 1 + J2f‘j + Jlfijfkl + ■ ■ • ] d 3 rt • • • d 3 r N . (7) 

A convenient way of enumerating the various terms in (7) is to associate each term with a 
corresponding Al-particle graph. For instance, if N were 8, the terms 

tA= //34/68 d 3 ri---d 3 r 8 and t B = fufuf67d 3 n--d 3 r 8 (8) 


in the expansion of the configuration integral Z 8 could be associated with the 8-particle 
graphs 


© © © ® 
© © (©©) 



(9) 


respectively. A closer look at the terms tA and t B (and at the corresponding graphs) suggests 
that we better regard these terms as suitably factorized (and the graphs correspondingly 
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decomposed), that is, 


= J £r ') 

f d 3 r 2 j 

f d 3 r 5 j 

f d 3 r 7 J 

f f34d 3 r 3 d 3 r 4 j 

f f 68 d 3 

r 6 d 3 r 8 

1 

+©] • [©] ■ [©] ■ [0 • ©-®] ■ ®-®1 1 


(10) 


and similarly 


t/i = 


J d 3 r 3 J d 3 r 5 J d 3 r 8 J fi 2 fud 3 rid 3 r 2 d 3 r 4 J f 67 d 3 r 6 d 3 r 7 


[®].[©].[®].[®-(t)] 



(it) 


We may then say that the term t^ in the expansion of the integral Z 8 represents a 
“configuration” in which there are four “clusters” of one particle each and two “clusters" of 
two particles each, while the term fg represents a “configuration” in which there are three 
“clusters” of one particle each, one “cluster” of two particles and one “cluster” of three 
particles. 

In view of this, we may introduce the notion of an N-particle graph which, by definition, 
is a “collection of N distinct circles, numbered 1,2,... ,N, with a number of lines linking 
some (or all) of the circles”; if the distinct pairs (of circles), which are linked through these 
lines, are denoted by the symbols a, ft,...,k (each of these symbols denoting a distinct pair 
of indices out of the set 1,2,..., N), then the graph represents the term 

J ( fafp ■ ■ -fr)d 3 ri ■ ■ ■ d 3 r N (12) 

of expansion (7). A graph having the same number of linked pairs as this one but with the 
set (a', (}’ ,..., k') distinct from the set (a, fi,.. will be counted as a distinct graph, for it 
represents a different term in the expansion; of course, these terms will belong to one and 
the same group in the expansion. Now, in view of the one-to-one correspondence between 
the various terms in the expansion (7) and the various /V-particle graphs, we have 

Zjv( V, T) — sum of all distinct AT-particle graphs. (13) 

Further, in view of the possible factorization of the various terms (or the possible decom¬ 
position of the various graphs), we may introduce the notion of an l-cluster which, by 
definition, is an “Z-particle graph in which each of the l circles, numbered 1,2,...,/, is 
directly or indirectly linked with every other circle.” As an example, we write here a 
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5-particle graph, which is also a 5-cluster: 



J fufuf 15 / 25/34 d 3 ri - ■■ d 3 r5. 


(14) 


It is obvious that a cluster as such cannot be decomposed into simpler graphs inasmuch 
as the corresponding term cannot be factorized into simpler terms. Furthermore, a group 
of / particles (except when / = 1 or 2) can lead to a variety of /-clusters, some of which may 
be equal in value; for instance, a group of three particles leads to four different 3-clusters, 
namely 



(15) 


of which the first three are equal in value. In view of the variety of ways in which an /-cluster 
can appear, we may introduce the notion of a cluster integral bj, defined by 


bi(V,T) = 


1 

I\x3(l-I)v 


x (the sum of all possible /-clusters). 


(16) 


So defined, the cluster integral bi(V, T) is dimensionless and, in the limit V 00 , 
approaches a finite value, b/T), which is independent of the size and the shape of the 
container (unless the latter is unduly abnormal). The first property is quite obvious. The 
second one follows by noting that if we hold one of the / particles fixed, at the point n 
say, and carry out integration over the coordinates of the remaining (/ — 1) particles, then, 
because of the fact that the functions fy extend only over a small finite range of distances, 
this integration would extend only over a limited region of the space available — a region 
whose linear dimensions are of the order of the range of the functions fij; 1 the result of 
this integration will be practically independent of the volume of the container. 2 Finally, we 
integrate over the coordinates n of the particle that was held fixed and obtain a straight 
factor of V; this cancels out the V in the denominator of the defining formula (16). Thus, 
the dependence of the cluster integral b/(V, T) on the size of the container is no more 
than a mere “surface effect’’ — an effect that disappears as V -»• 00 , and we end up with 
a volume-independent number bi(T). 

'Hence the name “cluster.” 

2 Of course, some dependence on the geometry of the container will indeed arise if the fixed particle happened to be 
close to the walls of the container. This is, however, unimportant when V ->• oo. 
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Some of the simpler cluster integrals are 

b ‘-U®l-rj d3r '~ h 

te= 23 d©-® 

oo 

~ J fnd^ru = jj J f(r)r 2 dr ( 17 ) 

0 

oo 

= ^ J (e~ u(r)/kT - l)r 2 dr, ( 18 ) 

o 

i >3 = —-=— x [sum of the clusters (15)] 

6X e V 

= 6 ^g y J ( /12/13 +/12/23 +/13/23 +fi2fi3f23)d 3 rid 3 r2d 3 r 3 

~ [^ 3l/ JJ fi2fi3d 3 r\ 2 d 3 ri3 + V Jj fi2fuf23d 3 ri2d 3 ru 

= JJ f]2f\?,f2?,d 3 r\2d 3 r\o i , (19) 

and so on. 

We now proceed to evaluate the crucial expression in (13). Obviously, an IV-particle 
graph will consist of a number of clusters of which, say, mi are 1 -clusters, m 2 are 2 -clusters, 
m 3 are 3-clusters, and so on; the numbers {m/} must satisfy the restrictive condition 

N 

J2 lm l=N, mi = 0,1,2,.. ,,N. (20) 

1=1 


However, a given set of numbers {mi} does not specify a unique, single graph; it represents 
a “collection of graphs” the sum total of which may be denoted by the symbol S{m/}. We 
may then write 


Z N (V,T) = J2s{mi}, ( 21 ) 

[mi) 

where the primed summation J2' goes over all sets [m.;| that conform to the restrictive 
condition (20). Equation (21) represents a systematic regrouping of the graphs, as opposed 
to the simple-minded grouping that first appeared in equation (7). 
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Our next task consists of evaluating the sum S{m/}. To do this, we observe that the 
“family of graphs” under the distribution set {m/} arises essentially from the following two 
causes: 

(i) there are, in general, many different ways of assigning the N particles of the system to 
the J2i m i clusters, and 

(i i) for any given assignment, there are, in general, many different ways of forming the 
various clusters, for even with a given group of l particles an /-cluster (if l > 2) can be 
formed in a number of different ways; see, for example, the four different ways of 
forming a 3-cluster with a given group of three particles, as listed in (15). 

For cause (i), we obtain a straightforward factor of 

N\ N\ 

-=-■ 22 ) 

(l!) m i(2!) m 2.. Y\(l\) m i 

1 

Now, if cause (ii) were not there, that is, if all /-clusters were unique in their formation, 
then the sum S{m/} would be given by the product of the combinatorial factor (22) with 
the value of any one graph in the setup, namely 

[ (the value of an /-cluster)" 1 ', (23) 

l 

further corrected for the fact that any two arrangements that differ merely in the exchange 
of all the particles in one cluster with all the particles in another cluster of the same size, 
must not be counted as distinct, the corresponding correction factor being 

Y[a/m,\). (24) 

i 

A little reflection now shows that cause (ii) is completely and correctly taken care of if we 
replace the product of the expressions (23) and (24) by the expression 3 

11 [(the sum of the values of all possible /-clusters)" 1 '//?//!] (25) 

/ 

which, with the help of equation (16), may be written as 

Y[ [(b//!A 3f/ ~ 1) V) m Vm / !]. (26) 

/ L 

3 To appreciate the logic behind this replacement, consider expression [ ] in (25) as a multinomial expansion and 
interpret the various terms of this expansion in terms of the variety of the /-clusters. 
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The sum S{m/} is now given by the product of factor (22) and expression (26). Substituting 
this result into (21), we obtain for the configuration integral 


Z N (V, T) = N\\ 3N 



(27) 


Here, use has been made of the fact that 

Y\(X 3l ) m i = x 3j: ‘ lm ‘ = X 3N ; (28) 

/ 

see the restrictive condition (20) . The partition function of the system now follows from 
equations (3) and (27), with the result 


Qn(V,T) = 



(29) 


The evaluation of the primed sum in (29) is complicated by the restrictive condi¬ 
tion (20), which must be obeyed by every set {m/}. We, therefore, move over to the grand 
partition function of the system: 


OO 

&(z, V,T)=J2 z n Qn(V, T). 

N= 0 


(30) 


Writing 


z iV =z L l lm l = Y\lz l ) m i, (31) 

l 

substituting for Qn(V , T) from (29), and noting that a restricted summation over sets {m;|, 
subject to the condition J2i = N, followed by a summation over all values of N (from 
N — 0 to N — oo) is equivalent to an unrestricted summation over all possible sets {m/}, we 
obtain 


Q(z,V,T)= J2 

mi,rri2,...=0 


t V\ mi 1 


=n 

i=i 

oo 

=n 

i=i 


n|K) 


mil 


mi =0 


exp [biz 1 ^ 


= exp 


^ i/ 

i=i 


(32) 
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and, hence, 



(33) 


In the limit V -»■ oo, 



i=i 


(34) 


and 



(35) 


Equations (34) and (35) constitute the famous cluster expansions of the Mayer-Ursell for¬ 
malism. Eliminating the fugacity z among these equations, we obtain the equation of state 
of the system. 


10.2 Virial expansion of the equation of state 


The approach developed in the preceding section leads to exact results only if we apply it to 
the gaseous phase alone. If we attempt to include in our study the phenomena of conden¬ 
sation, the critical point, and the liquid phase, we encounter serious difficulties relating 
to (i) the limiting procedure involved in equations (10.1.34) and (10.1.35), (ii) the conver¬ 
gence of the summations over /, and (iii) the volume dependence of the cluster integrals 
bi. We, therefore, restrict our study to the gaseous phase alone. The equation of state may 
then be written in the form 



( 1 ) 


where v (= V/N) denotes the volume per particle in the system. Expansion (1), which 
is supposed to have been obtained by eliminating z between equations (10.1.34) and 
(10.1.35), is called the virial expansion of the system and the numbers afiT) the virial coef¬ 
ficients. 4 To determine the relationship between the coefficients «/ and the cluster integrals 
bi, we invert equation (10.1.35) to obtain z as a power series in (A. 3 /if) and substitute this 

4 For various manipulations of the virial equation of state, see Kilpatrick and Ford (1969). 
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into (10.1.34). This leads to equation (1), with 


a\ = fi\ = 1 , ( 2 ) 

OO 

a 2 = -t, 2 = f (e-“«/ fcr - l) r 2 dr, (3) 

0 

OO OO 

« = 48| - 2fe = - jl. / f M 13 f 23 d\ 2 d\ 3 , (4) 

0 0 

<24 = -20g| + 18g 2 B3-3B4 = ••• , (5) 


and so on; here, use has also been made of formulae (10.1.17) to (10.1.19). We note that 
the coefficient a/ is completely determined by the quantities bi, bi,..., bp that is, by the 
sequence of configuration integrals Zj,Z 2 ,.. .,Z/; see also equations (10.4.5) to (10.4.8). 
From equation (4) we obse rve that the third virial coefficient of the gas is determined 


solely by the 3-cluster 


. This suggests that the higher-order virial coefficients may 


also be determined solely Dy special “subgroup” of the various /-clusters. This is indeed 
true, and the relevant result is that, in the limit of infinite volume, 5 


l- 1 

«/ =-(/> 2 ), 


( 6 ) 


where hi-i is the so-called irreducible cluster integral, defined as 


/J/_i =-——— x (the sum of all irreducible /-clusters); 

(/-l)!X 3 d-Dy 


(7) 


by an irreducible /-cluster we mean an “/-particle graph that is multiply-connected (in the 
sense that there are at least two entirely independent, nonintersecting paths linking each 
pair of circles in the graph).” For instance, of the four possible 3-clusters, see (10.1.15), 
only the last one is irreducible. Indeed, if we express equation (4) in terms of this particular 
cluster and make use of definition (7) for /) 2 , we do obtain for the third virial coefficient 

U3 = —~P2, ( 8 ) 


in agreement with the general result (6). 6 

The quantities hi- 1 > like hi, are dimensionless and, in the limit V —oc, approach 
finite values that are independent of the size and the shape of the container (unless the 

5 For a proof of this result, see Hill (1956, Sections 24 and 25); see also Section 10.4 of the present text. 

6 It may be mentioned here that a 2-cluster is also regarded as an irreducible cluster. Accordingly, Pi = 2bz 1 see 
equations (10.1.16) and (10.2.7). Equation (3) then gives: «2 = -bz = - i Pi, again in agreement with the general result (6). 
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latter is unduly abnormal). Moreover, the two sets of quantities are mutually related; see 
equations (10.4.27) and (10.4.29). 


10.3 Evaluation of the virial coefficients 

If a given system does not depart much from the ideal-gas behavior, its equation of state 
is given adequately by the first few virial coefficients. Now, since a\ = 1, the lowest-order 
virial coefficient that we need to consider here is a 2 , which is given by equation (10.2.3): 

OO 

a 2 = -fe = ^ J (l - e~ u ^ kr ) r 2 dr, (1) 

o 

u(r ) being the potential energy of interparticle interaction. A typical plot of the function 
u(r) was shown earlier in Figure 10.1; a typical semi-empirical formula (Lennard-Jones, 
1924) is given by 


u(r ) = 4 e 



( 2 ) 


The most significant features of an actual interparticle potential are well-simulated by the 
Lennard-Jones formula (2). For instance, the function u(r ) given by (2) exhibits a “mini¬ 
mum,” of value —e, at a distance ro(= 2 1 / 6 <t) and rises to an infinitely large (positive) value 
for r < a and to a vanishingly small (negative) value for r » a. The portion to the left of the 
“minimum” is dominated by repulsive interaction that comes into play when two particles 
come too close to one another, while the portion to the right is dominated by attractive 
interaction that operates between particles when they are separated by a respectable dis¬ 
tance. For most practical purposes, the precise form of the repulsive part of the potential 
is not very important; it may as well be replaced by the crude approximation 


u(r) = +00 (for r < ro), 


(3) 


which amounts to attributing an impenetrable core, of diameter ro, to each particle. The 
precise form of the attractive part is, however, important; in view of the fact that there 
exists good theoretical basis for the sixth-power attractive potential (see Problem 3.36), 
this part may simply be written as 


u(r) = —M 0 (r 0 /r) 6 (r > r 0 ). (4) 

The potential given by expressions (3) and (4) may, therefore, be used if one is only inter¬ 
ested in a qualitative assessment of the situation and not in a quantitative comparison 
between the theory and experiment. 
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Substituting (3) and (4) into (1), we obtain for the second virial coefficient 


OO 



y’r 2 dr+/[l-expjg(^) b j]r 2 dr 


(5) 


0 


The first integral is straightforward; the second one is considerably simplified if we 
assume that (uo/kT) 1, which makes the integrand very nearly equal to —(uo/kT)(ro/r) 6 . 
Equation (5) then gives 



( 6 ) 


Substituting (6) into the expansion (10.2.1), we obtain a first-order improvement on the 
ideal-gas law, namely 



(7a) 



(7b) 


The coefficient B 2 , which is also sometimes referred to as the second virial coefficient of 
the system, is given by 



( 8 ) 


In our derivation it was explicitly assumed that (i) the potential function u(r) is given 
by the simplified expressions (3) and (4), and (ii) (uo/kT) < 1. We cannot, therefore, expect 
formula (8) to be a faithful representation of the second virial coefficient of a real gas. Nev¬ 
ertheless, it does correspond, almost exactly, to the van der Waals approximation to the 
equation of state of a real gas. This can be seen by rewriting (7a) in the form 



which readily leads to the van der Waals equation of state 



(9) 


where 


a = 




= 4c 0 . 


3 


and h = 


3 


( 10 ) 
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FIGURE 10.2 A dimensionless plot showing the temperature dependence of the second virial coefficient of several 
gases (after Hirschfelder et at, 1954). 


We note that the parameter b in the van der Waals equation of state is exactly four times 
the actual molecular volume v 0 , the latter being the “volume of a sphere of diameter ro”; 
compare with Problem 1.4. We also note that in this derivation we have assumed that 
b<^v, which means that the gas is sufficiently dilute for the mean interparticle distance to 
be much larger than the effective range of the interparticle interaction. Finally, we observe 
that, according to this simple-minded calculation, the van der Waals parameters a and b 
are temperature-independent, which in reality is not true. 

A realistic study of the second virial coefficient requires the use of a realistic potential, 
such as the one given by Lennard-Jones, for evaluating the integral in (1). This has indeed 
been done and the results obtained are shown in Figure 10.2, where the reduced coefficient 
B' 2 (— B 2 /Tq) is plotted against the reduced temperature T'(= kT /e): 

OO 

B' 2 (T') = 2ji J (l - r' 2 dr', (11) 

o 


with 


u'(r') = 



( 12 ) 


r’ being equal to (r/ro); expressed in this form, the quantity is a universal function of 
V. Included in the plot are experimental results for several gases. We note that in most 
cases the agreement is reasonably good; this is especially satisfying in view of the fact 
that in each case we had only two adjustable parameters, ro and e, against a much larger 
number of experimental points available. In the first place, this agreement vindicates 
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the adequacy of the Lennard-Jones potential for providing an analytical description of a 
typical interparticle potential. Secondly, it enables one to derive empirical values of the 
respective parameters of the potential; for instance, one obtains for argon: ro = 3.82 A 
and e/k= 120 K . 7 One cannot fail to observe that the lighter gases, hydrogen and helium, 
constitute exceptions to the rather general rule of agreement between the theory and 
experiment. The reason for this lies in the fact that in the case of these gases quantum- 
mechanical effects assume considerable importance — more so at low temperatures. To 
substantiate this point, we have included in Figure 10.2 theoretical curves for H 2 and He 
taking into account the quantum-mechanical effects as well; as a result, we find once again 
a fairly good agreement between the theory and experiment. 

As regards higher-order virial coefficients (Z > 2), we confine our discussion to a gas of 
hard spheres with diameter D. We then have 


and, hence, 



if r > D, 
if r<D, 



if r > D, 
if r<D. 


(13) 


(14) 


The second virial coefficient of the gas is then given by 


2?rD 3 

a2 = ^ 



(15) 


compare with equation ( 6 ). The third virial coefficient can be determined with the help of 
equation (10.2.4), namely 


“ s = "3l ef (16 > 

0 0 

To evaluate this integral, we first fix the positions of particles 1 and 2 (such that ri 2 < D) 
and let particle 3 take all possible positions so that we can effect an integration over the 
variable / 13 ; see Figure 10.3. Since our integrand is equal to — 1 when each of the distances 
ri 3 and r 23 (like ri 2 ) is less than D and 0 otherwise, we have 

D 

fl3 = 3l® / {/ rf3ri3 } rf3,-1 2’ (17) 

02=0 

where the primed integration arises from particle 3 taking all possible positions of interest. 
In view of the conditions n 3 < D and r 23 < D, this integral is precisely equal to the “volume 


7 Corresponding values for various other gases have been summarized in Hill (1960, p. 484). 
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/ dy \ 



y \ 



4-D r-j2~N^—^ 12 — r-\ 2~k 



1 

2 


FIGURE 10.4 




common to the spheres Si and S 2 , each of radius D, centered at the fixed points 1 and 
2”; see Figure 10.4. This in turn can be obtained by calculating the “volume swept by the 
shaded area in the figure on going through a complete revolution about the line of centers.’’ 
One gets: 


VID 2 -^^) 2 ] 

J d 3 r 13 = J | 2 (D 2 — y 2 ) 1/2 — /12 j 2nydy. (18) 

0 

While the quantity within the curly brackets denotes the length of the strip shown in the 
figure, the element of area 2 itydy arises from the revolution; the limits of integration for 
y can be checked rather easily. The evaluation of the integral (18) is straightforward; 
we get 


I'^-T K 


3D 2 r 12 


+ X 

16 


(19) 


Substituting (19) into (17) and carrying out integration over ri 2 , we finally obtain 


a 3 = 


57r z D 


2 n 6 


— Cl'). 


m 6 8 


( 20 ) 
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The fourth virial coefficient of the hard-sphere gas has also been evaluated exactly. It is 
given by (Boltzmann, 1899; Majumdar, 1929) 8 

f 1283 3 73V(2) + 1377{tan-V(2)-^/4}) 3 

[ 8960 2 1120tt J 2 

= 0.28695^2- (21) 

The fifth and sixth virial coefficients of this system have been computed numerically, with 
the results (Ree and Hoover, 1964) 


a 5 = (0.1103 ±0.003)a|, (22) 

and 

a 6 = (0.0386 ± 0.004)af. (23) 

Ree and Hoover’s estimate of the seventh virial coefficient is 0.0127a 8 . Terms up through 
10th order have been determined numerically; see Hansen and McDonald (1986) and 
Malijevsky and Kolafa (2008). If the virial equation of state for hard spheres is written in 
terms of the volume packing fraction = tt n.D ''/ 6 , the first ten terms are 

p 

—— = 1 + 4 rj + 10 ^ 2 + 18.364768;; 3 + 28.22445?y 4 + 39.81545)7 5 
nkT 

+ 53.3418;; 6 + 68 . 534 t ; 7 + 85.805!? 8 + 105. 8 ( 7 ® + • • ■ . (24) 

Carnahan and Starling (1969) proposed a simple form for the equation of state that 
closely approximates all of the known virial coefficients: 

P 1 + r) + )j 2 — 7j 3 
nkT (1 — u ) 3 

= 1 + 4?7 + 10?? 2 + 18 ? 7 3 + 28 ? 7 4 + 40 ? j 5 
+ 54 i 7 6 + 70 ? 7 7 + 88 ? 7 8 + 108 ? 7 9 + 130? 7 10 + • ■ ■ ( 25 ) 

This gives an excellent fit to the hard sphere equation of state for the entire fluid phase 
as determined in computer simulations. The fluid phase is the equilibrium phase for 
0 < rj < 0.491. The high-density equilibrium phase of hard spheres is a face-centered cubic 
solid; see Chapter 16. Many other approximate analytical forms have also been proposed 
to closely reproduce the virial series; see for instance Mulero et al. (2008). 


! See also Katsura (1959). 
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10.4 General remarks on cluster expansions 

Shortly after the pioneering work of Mayer and his collaborators, Kahn and Uhlenbeck 
(1938) initiated the development of a similar treatment for quantum-mechanical systems. 
Of course, their treatment applied to the limiting case of classical systems as well but it 
faced certain inherent difficulties of analysis, some of which were later removed by the 
formal methods developed by Lee and Yang (1959a,b; 1960a,b,c). We propose to examine 
these developments in the next three sections of this chapter. First, however, we would like 
to make a few general observations on the problem of cluster expansions. These obser¬ 
vations, due primarily to Ono (1951) and Kilpatrick (1953), are of considerable interest 
insofar as they hold for a very large class of physical systems. For instance, the system 
may be quantum-mechanical or classical, it may be a multicomponent one or single¬ 
component, its molecules may be polyatomic or monatomic, and so on. All we have to 
assume is that (i) the system is gaseous in state and (ii) its partition functions Qn(V, T), 
for some low values of N, can somehow be obtained. We can then calculate the “cluster 
integrals” bi, and the virial coefficients a;, of the system in the following straightforward 
manner. 

Quite generally, the grand partition function of the system can be written as 


&(z, V,T)=Y, Qn(V, T)z n = Y, Zn( ^ T) ( 


N= 0 


:j) 


N 


( 1 ) 


where we have introduced the “configuration integrals” Z\;(V, T), defined in analogy with 
equation (10.1.3) of the classical treatment: 


Z N (V, T) = N\1 3N Q n (V, T). 


( 2 ) 


Dimensionally, the quantity Z/v is like (a volume)^; moreover, the quantity Zo (like Qo) is 
supposed to be identically equal to 1, while Z\{= k 3 Qi) is identically equal to V. We then 
have, in the limit V -»■ oo, 


P 

kT 





1 OO 

= pE0' z ' sa y- 


i=i 


(3) 

(4) 


Again, the last expression has been written in analogy with the classical expansion 
(10.1.34); the coefficients hi may, therefore, be looked upon as the cluster integrals of the 
given system. Expanding (3) as a power series in z and equating respective coefficients with 
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the bi of (4), we obtain 


w 

II 

'nI ► 

Nl 

III 

h-< 

(5) 

_ 2!X 3 V (Zz Zl) ’ 

(6) 

&3 - 31 3Z 2 Zi+2Z 1 ), 

(7) 

S 4 = ^9^ (Z 4 - 4Z 3 Zi - 3zf + 12Z 2 Zf - 6Zf), 

(8) 


and so on. We note that, for all l > 1, the sum of the coefficients appearing within the 
parentheses is identically equal to zero. Consequently, in the case of an ideal classical gas, 
for which Z/ = V‘, see equation (10.1.4), all cluster integrals with l > 1 vanish. This, in turn, 
implies the vanishing of all the virial coefficients of the gas (except, of course, ai, which is 
identically equal to unity). 

Comparing equations (6) through (8) with equation (10.1.16), we find that the expres¬ 
sions involving the products of the various Z ; - that appear within the parentheses play the 
same role here as “the sum of all possible /-clusters” does in the classical case. We there¬ 
fore expect that, in the limit V —>■ oo, the h/ here would also be independent of the size and 
the shape of the container (unless the latter is unduly abnormal). This, in turn, requires 
that the various combinations of the Z, appearing within the parentheses here must all be 
proportional to the first power of V. This observation leads to the very interesting result, 
first noticed by Rushbrooke, namely 

t>i = — | x (the coefficient of V 1 in the volume expansion of Z/). (9) 

At this stage, it seems worthwhile to point out that the expressions appearing within 
the parentheses of equations (6) through (8) are well-known in mathematical statistics as 
the semi-invariants of Thiele. The general formula for these expressions is 

(...)»= g/{^ 3a_1 M 

l m ii 

where the primed summation goes over all sets { rn ,-} that conform to the condition 

i 

Y_ inij = /; 
i=i 



m; = 0 , 1 , 2 ,_ 


( 11 ) 
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Relations inverse to (10) can be written down by referring to equation (10.1.29) of the 
classical treatment; thus 

w \ | 

Z m = M\X 3M Q m = M\X 3M 2^ 11 - - - , (12) 

m >■ 1 

where the primed summation goes over all sets {m/} that conform to the condition 

M 

= m l = 0,1,2.... (13) 

1=1 

The calculation of the virial coefficients fl; now consists of a straightforward step that 
involves a use of formulae (5) through (8) in conjunction with formulae (10.2.2) through 
(10.2.5) . It appears, however, of interest to demonstrate here the manner in which the gen¬ 
eral relationship (10.2.6) between the virial coefficients a.[ and the “irreducible cluster inte¬ 
grals’’ Pi_i arises mathematically. As a bonus, we will acquire yet another interpretation of 
the p k . 

Now, in view of the relations 

P ( \ \ 1 °° 

if-a (14) 

v 7 i=i 


and 


1 . /z9lnQ\ 

- = Lim —- 

V V—>oo \ V dZ / 


, oo 


l=i 


we can write 

P(z) C 1 dz 

kT J v(z) z ' 
o 

We introduce a new variable x, defined by 

x = nX 3 = X 3 /v. 

In terms of this variable, equation (15) becomes 

OO 

x(z) = J^lfpz 1 , 
1=1 


(15) 


(16) 


(17) 


(18) 


the inverse of which may be written (see Mayer and Harrison, 1938; Harrison and Mayer, 
1938; also Kahn, 1938) as 


z(jc) = xexp{— 4>(x)}. 


( 19 ) 
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In view of the fact that, for z <£ 1, the variables z and x are practically the same, the function 
(p (x) must tend to zero as x ->• 0 ; it may, therefore, be expressed as a power series in x: 

OO 

c/>(x) = J2^ xk - ( 20 ) 

k= 1 


It may be mentioned beforehand that the coefficients fa of this expansion are ulti¬ 
mately going to be identified with the “irreducible cluster integrals” yS/_i. Substituting from 
equations (17), (19), and (20) into equation (16), we get 



Combining (17) and (21), we obtain 



( 21 ) 


( 22 ) 


Comparing this result with the virial expansion (10.2.1), we arrive at the desired relation¬ 
ship: 


ai = - ^ P[-i (/> 1 ). ( 23 ) 

For obvious reasons, the fa appearing here may be regarded as a generalization of the 
irreducible cluster integrals of Mayer. 

Finally, we would like to derive a relationship between the fa and the bi- For this, we 
make use of a theorem due to Lagrange which, for the present purpose, states that “the 
solution x(z) to the equation 


z(x) = x/f (x) 


( 24 ) 


is given by the series 


OO / 

x—\ z> 

x(2) = E j 

7=1 


a 


/—i 


dy 


pri /®} 7 


Jt=o 


( 25 ) 


it is obvious that the expression within the square brackets is (7 — 1 )! times “the coefficient 
of in the Taylor expansion of the function about the point £ = 0.” Applying this 
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theorem to the function 


f(x) = explflHx)} = exp 



OO 

n ex P(Pk xic )’ 

k= 1 


(26) 


we obtain 


t) 


x(z) = J2j ( j~ l)l 




the coefficient off / 1 in the Taylor expansion 


of Y\ exp(J ; p k % k ) about £ = 0 1. 
k= 1 


Comparing this with equation (18), we get 
1 


Oj = X 
J ] 2 


the coefficient of ^ 1 in J" ] 

k= 1 

, 7~1 




m k-Q 


m k l 


= 4 E'n^?. (27) 

/ (»T)^ "h! 

where the primed summation goes over all sets { m G that conform to the condition 

Y_ knrfc =j — 1; mfc = 0,1,2,_ (28) 

fc=i 

Formula (27) was first obtained by Maria Goeppert-Mayer in 1937. Its inverse, however, 
was established much later (Mayer et al., 1942; Kilpatrick, 1953): 


{m,J (/ -1)! 11 




(29) 


where the primed summation goes over all sets {m,} that conform to the condition 

i 

yp'- l)rrii = l- 1; m,- = 0,l,2,.... (30) 

i=2 

It is not difficult to see that the highest value of the index i in the set {m,-} would be l 
(the corresponding set having all its m,; equal to 0, except ni[ which would be equal to 1); 
accordingly, the highest order to which the quantities bi would appear in the expression 
for is that of bi■ We thus see, once again, that the virial coefficient fl/ is completely 
determined by the quantities b\, fe,..., &/. 
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10.5 Exact treatment of the second 
virial coefficient 

We now present a formulation, originally from Uhlenbeck and Beth (1936) and Beth and 
Uhlenbeck (1937), that enables us to make an exact calculation of the second virial coeffi¬ 
cient of a quantum-mechanical system from a knowledge of the two-body potential u(r). 9 
In view of equation (10.4.6), 


S2= - az= ^v( Z2 ~ z 0- 


2X 3 V 

For the corresponding noninteracting system, one would have 


gf = - a r = 


( 0 ) 


= J-( 

n?v v 


(z‘ o) ~z[ o)2 y, 


( 1 ) 


( 2 ) 


the superscript (0) on the various symbols here implies that they pertain to the noninteract¬ 
ing system. Combining (1) and (2), and remembering that Z\ = Zj 0) = V, we obtain 




(3) 


which, by virtue of relation (10.4.2), becomes 


te-6f-y(<j2-Qf)-^Tr( 


= - Tr e-P” 2 - e-P'C 


( 0 ) 


(4) 


For evaluating the trace in (4), we need to know the eigenvalues of the two-body 
Hamiltonian which, in turn, requires solving the Schrodinger equation 10 


H2V a (ri,r 2 )=E a 'li a (ri,r2), 


(5) 


where 


h 2 = _^( v; + v|) 

Transforming to the center-of-mass coordinates R 
nates r{— (r 2 — it )}, we have 


+ u(ri 2 ). (6) 

| = | (n + it) | and the relative coordi- 


^ a (R,r) = iljj(R)fn(r) = 


1 

PV 2 


e i(Pj-R)/h 


tyn(r). 


(7) 


with 


F — 


±_ 

2(2 m) 


+ B n - 


(8) 


9 For a discussion of the third virial coefficient, see Pais and Uhlenbeck (1959). 

10 For simplicity, we assume the particles to be “spinless.” For the influence of spin, see Problems 10.11 and 10.12. 
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Here, P denotes the total momentum of the two particles and 2m their total mass, while s 
denotes the energy associated with the relative motion of the particles; the symbol a refers 
to the set of quantum numbers j and n that determine the actual values of the variables P 
and £. The wave equation for the relative motion will be 


h 2 


2 (2 m ) 


V 2 + u(r ) 1 f n (r) = e n f n (r), 


0 ) 


I m being the reduced mass of the particles; the normalization condition for the relative 
wavefunction will be 


J\xfr n (r)\ 2 d 3 r= 1. 

Equation (4) thus becomes 

fc- 62 0) = yE( e '^“- e "^ r ] 

= — Y^e~ pP i /im Y^ {e _/ten - e-^ 

v j n l 


( 10 ) 


( 11 ) 


For the first sum, we obtain 


X^ e -PPj/ 4m _ ^X. J e -f>P 2 /4m p 2 d p = 8V2 V 

4 " h 3 J X 3 ’ 

J 0 


so that equation (11) becomes 

g 2 _g(0) = 8 1 /2^j e -^- e -M 0) 

n 


( 12 ) 


(13) 


The next step consists of examining the energy spectra, e n and 4, 0 ) , of the two systems. 
In the case of a noninteracting system, all we have is a “continuum” 


e 


( 0 ) _ 
n ~ 



h 2 k 2 


m 


C k = p/h ), 


(14) 


with the standard density of states g (0> (k). In the case of an interacting system, we may 
have a set of discrete eigenvalues eb (that correspond to “bound” states), along with a 
“continuum” 


h 2 k 2 


En = 


m 


(k — p/h), 


(15) 
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with a characteristic density of states g(k). Consequently, equation (13) can be written as 

OO 

p2- = 8 1/2 J2 e ~ fiSB + 81/2 f e^ fi2fc2/m {g(fc) - g <0) (k)}dk, (16) 

B o 

where the summation in the first part goes over all bound states made possible by the 
two-body interaction. 

The next thing to consider here is the density of states g(k). For this, we note that, since 
the two-body potential is assumed to be central, the wavefunction f n (r) for the relative 
motion may be written as a product of a radial function y(r) and a spherical harmonic 
Y(6,<p): 

i r klm ( r ) = A klm Y lm (6,<p). (17) 

Moreover, the requirement of symmetry, namely i /r(— r) = fir) for bosons and \!/(—r) = 
—i j/{r) for fermions, imposes the restriction that the quantum number l be even for bosons 
and odd for fermions. The (outer) boundary condition on the wavefunction may be written 
as 


Xkl(Ro) = 0 , 


(18) 


where R 0 is a fairly large value (of the variable r) that ultimately goes to infinity. Now, the 
asymptotic form of the function Xki( r ) is well-known: 

/ fc ;(r)cxsin|fcr-y+ f?/(fc)|; (19) 


accordingly, we must have 

kRo — y + rn(k) = nn, n= 0,1,2,_ (20) 

The symbol in(k) here stands for the scattering phase shift due to the two-body potential 
u(r ) for the Zth partial wave of wave number k. 

Equation (20) determines the full spectrum of the partial waves. To obtain from it an 
expression for the density of states gi(k), we observe that the wave number difference A k 
between two consecutive states n and n + 1 is given by the formula 


*° + ^n 


A k = n, 


( 21 ) 


with the result that 


gl(k) = 


21+1 
A k 


21+1 


Ro + 


drn(k) | 


jr 


dk 


( 22 ) 
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the factor (2/ + 1) has been included here to take account of the fact that each eigenvalue 
k pertaining to an Zth partial wave is (2 1 + l)-fold degenerate (because the magnetic quan¬ 
tum number m can take any of the values Z, (l - 1),..., -Z, without affecting the eigenvalue). 
The total density of states, g(k), of all partial waves of wave numbers around the value k is 
then given by 


g(k) = J2 gi(k) = -j-E (2Z+1) + (23) 

note that the primed summation J2' goes over / = 0,2,4,... in the case of bosons and over 
l— 1,3,5,... in the case of fermions. For the corresponding noninteracting case, we have 
(since all ;p(k) = 0) 


g (0) (k) = —E(2Z+1). 

^ i 


(24) 


Combining (23) and (24), we obtain 


g(k)-g {0 Hk)=-J2(2l+l) 


drnik) 

dk 


(25) 


Substituting (25) into (16), we obtain the desired result 

8 1 / 2 


02 — 


gf = 8!/2yy/fca + —E'(2Z+ 1) [e-^ k2 > md ^clk (26) 

„ X 1 J 3 k 


which, in principle, is calculable for any given potential u(r) through the respective phase 
shifts m(k). 

Equation (26) can be used for determining the quantity t >2 - fh ' ■ To determine Zfc itself, 
we must know the value of . This has already been obtained in Section 7.1 for bosons 
and in Section 8.1 for fermions; see equations (7.1.13) and (8.1.17). Thus 


K (0) 


= -a ( n 0) = ± 


25/2 ' 


(27) 


where the upper sign holds for bosons and the lower sign for fermions. It is worthwhile to 
note that the foregoing result can be obtained directly from the relationship 


fi (0)_ 1 / 7 (0) 7 (0)2\ _ ( n (0) 1 n (0)2\ 
2 “ 2X 3 V\ 2 1 )~ V V^ 2 2^! ) 
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by substituting for Q ( 2 01 the exact expression (5.5.25): 




A 2 

V 




1 

2 



2 " 



(28) 11 


It is of interest to note that this result can also be obtained by using the classical for¬ 
mula (10.1.18) and substituting for the two-body potential u(r) the “statistical potential” 
(5.5.28); thus 


gf = ^ f (e~Us(r)/kT _ ^ r 2 dr 
0 

oo 

= ± ^/ e_2 ^ AVdr = ± i- (29) 

o 

As an illustration of this method, we now calculate the second virial coefficient of a gas 
of hard spheres. The two-body potential in this case may be written as 


u(r) = 


+oo for r < D 
0 for r > D. 


(30) 


The scattering phase shifts rn(k) can now be determined by making use of the (inner) 
boundary condition, namely x(r) = 0 for all r < D and hence it vanishes as r —a D from 
the above. We thus obtain (see, for example, Schiff, 1968) 


rii0c) = tan 1 


jlQcD) 
ni(kD) ’ 


(31) 


where j)(x) and n/(x) are, respectively, the “spherical Bessel functions” and the “spherical 
Neumann functions”: 

sinx sinx —xcosx 

Jo(X)= -, ] i(x)=-=-, 

X X A 

(3 — x 2 ) sinx — 3xcosx 

72 W=-3 


and 


cosx cosx + xsinx 

«oW =-. ni(x)= -^- 

x x z 

(3 — x 2 ) cosx + 3xsinx 
n 2 (x) =-^-,.... 


n This calculation incidentally verifies the general formula (10.4.9) for the case 1 = 2. By that formula, the “cluster 
integral” I 02 of a given system would be equal to 1/(2X 3 ) times the coefficient of V 1 in the volume expansion of the 
“configuration integral” Z 2 of the system. In the case under study, this coefficient is ±A 3 /2 3 ^ 2 ; hence the result. 
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Accordingly, 


r] 0 (k) = tan x {—tan(fcD)} = — kD, 


r/i(k) = tan 1 


tan(/cD) — kD 
1 + kDtan(kD) 


(kD) 3 | (kD) 5 
3 + 5 


— [kD— tan l (kD)} 


ri 2 (k) = tan 1 


tan(kD)— 3(kD)/[3—(kD) 2 ] | 
1 + 3 (kD) tan(7cD)/[3 - (kD) 2 ] J 


| kD — tan 1 


3 (kD) 

3 — (kD) 2 


(kD) 5 

- 1 - 

45 


(32) 


(33) 


(34) 


and so on. We now have to substitute these results into formula (26). However, before doing 
that we should point out that, in the case of hard-sphere interaction, (i) we cannot have 
bound states at all and (ii) since, for all l, rji(0 ) = 0, the integral in (26) can be simplified by 
a prior integration by parts. Thus, we have 


fe - sf 


8 V 2 A 2 


7T 


2 


D (2/+l) f e-^ h2k2/m m (k)kdk. 


(35) 


Substituting for l = 0 and 2 in the case of bosons and for Z = 1 in the case of fermions, we 
obtain (to fifth power in D/A.) 


82 - ©2 0) = -2 


D 

X 

(D 


1 IOtt 2 /D ^ 5 


= —6?r - + 187T - - 


D 


(Bose) 

(Fermi), 


(36) 

(37) 


which maybe compared with the corresponding classical result —(27r/3)(D/A) 3 . 


10.6 Cluster expansion for a quantum-mechanical 
system 

When it comes to calculating hi for / > 2 we have no formula comparable in simplicity to 
formula (10.5.26) for fe- This is due to the fact that we have no treatment of the /-body 
problem (for / > 2) that is as neat as the phase-shift analysis of the two-body problem. 
Nevertheless, a formal theory for the calculation of higher-order “cluster integrals” has 
been developed by Kahn and Uhlenbeck (1938); an elaboration by Lee and Yang (1959a,b; 
1960a,b,c) has made this theory almost as good for treating a quantum-mechanical sys¬ 
tem as Mayer’s theory has been for a classical gas. The basic approach in this theory is to 
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evolve a scheme for expressing the grand partition function of the given system in essen¬ 
tially the same way as Mayer’s cluster expansion does for a classical gas. However, because 
of the interplay of quantum-statistical effects and the effects arising from interparticle 
interactions, the mathematical structure of this theory is considerably involved. 

We consider here a quantum-mechanical system of N identical particles enclosed in a 
box of volume V. The Hamiltonian of the system is assumed to be of the form 

^=-^E v i 2 +E“< r </>- (1) 

«= 1 i<i 

Now, the partition function of the system is given by 

Q n (V, T) = Tr (e~ pkN ) = ^e~ pEa 

a 

= J2 f {^(1.• • .,N)e~^ N ... ,N)}d m r, (2) 

“ v 

where the functions Tv are supposed to form a complete set of (properly symmetrized) 
orthonormal wavefunctions of the system, while the numbers l,...,iV denote the posi¬ 
tion coordinates r\ ,..., rjv, respectively. We may as well introduce the probability density 
operator Wn of the system through the matrix elements 

(1',... ,N'\Wn\1, ... ,N) =N!A 3Jv ^{'Iv(l',...,W)e - ^ iv 'I'*(l,...,A0} 

a 

= N\X 3N J2^o,( T,.. • .. ,,N)}e ~ pEa . (3) 

a 

We denote the diagonal elements of the operator Wn by the symbols Wv(l,. ..,1V); thus 

W N ( 1, ...,N) = N\X 3N ^{Tf„(l,...,1V)T/*(1,.. .,N)}e~ pEa , (4) 

a 

whereby equation (2) takes the form 

QN(V ’ T)= N^f ^va,..,Ar)d 3W r=^Tr(Wv). (5) 

v 

A comparison of equation (5) with equations (10.1.3) and (10.4.2) shows that the “trace of 
the probability density operator Wn" is the analogue of the “configuration integral” Zy, 
and the quantity Wn( 1,... ,N)d iN r is a measure of the probability that the “configuration” 
of the given system is found to be within the interval [(n,..., r,y), (n + dr \,..., r,y + drjv)]. 

Before we proceed further, let us acquaint ourselves with some of the basic properties 
of the matrix elements (3) : 
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(i) 


(i'imilt) = A 3 



1 e Kpr' 1 )/h 1 e -i(p-ri)/h 

, JV A 


e -/!p 2 /2m 



Vd3 P Jip- (r'j-n )/h-pr?/2m) 

h 3 


= g _7r l r i _r il 2 A 2 ; 


( 6 ) 


compare with equation (5.3.14) for the density matrix of a single particle. The 
foregoing result is a manifestation of the quantum-mechanical, not 
quantum-statistical, correlation between the positions r and r' of a given particle (or, 
for that matter, any particle in the system). This correlation extends over distances 
of the order of X which is, therefore, a measure of the linear dimensions of the wave 
packet representing the particle. As T -»• oo, and hence X -> 0, the matrix element (6) 
tends to zero for all finite values of Ir^ — r \|. 

(ii) (l|Wi|l) = l; (7) 

consequently, by equation (5), 

Qi(c,D = l/mV=^. (8) 

v 

(iii) Whatever the symmetry character of the wavefunctions 4\ the diagonal elements 
W/v(l,..., AT) of the probability density operator W N are symmetric in respect of a 
permutation among the arguments (1,... ,1V). 

(iv) The elements W/v(l, ...,N) are invariant under a unitary transformation of the set 
{*«}. 

(v) Suppose that the coordinates n,..., are such that they can be divided into two 
groups, A and B, with the property that any two coordinates, say r ; - and rj, of which 
one belongs to group A and the other to group B, satisfy the conditions that 

(a) the separation r,j is much larger than the mean thermal wavelength X of the 
particles, and 

(b) it is also much larger than the effective range ro of the two-body potential, then 


W N (r lt ...,r N ) ~ W A (r A )W B (r B ), 


(9) 


where r A and r B denote collectively the coordinates in group A and group B, 
respectively. It is not easy to furnish here a rigorous mathematical proof of this 
property, though physically it is quite understandable. One can see this by noting 
that, in view of conditions (a) and (b), there does not exist any spatial correlation 
between the particles of group A on one hand and the particles of group B on the 
other (either by virtue of statistics or by virtue of interparticle interactions). The two 
groups, therefore, behave toward each other like two independent entities. It is then 
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natural that, to a very good approximation, the probability density Wn of the 
composite configuration be equal to the product of the probability densities Wa 
and We- 

We now proceed with the formulation. First of all, to fix ideas about the approach to be 
followed, we may consider the case with N = 2. In that case, as ri 2 -*■ oo, we expect, in view 
of property (v), that 


W 2 (l,2)-> Wi(l)Wi(2) = l. (10) 

In general, however, W 2 (l,2) will be different from W\ (1) Wj (2). Now, if we denote the 
difference between W 2 (l,2) and fVi(l)Wi(2) by the symbol f/ 2 (l,2), then, as ri 2 oo, 

U 2 ( 1,2)-*0. (11) 

It is not difficult to see that the quantity t / 2 (1,2) is the quantum-mechanical analogue of 
the Mayer function^-. With this in mind, we introduce a sequence of cluster functions U\ 
defined by the hierarchy 12 

(l'|W 1 |l) = (l'|t/i|l), ( 12 ) 

<l',2 , |W 2 |l,2) = (l , |t>i|l)(2 , |[>i|2) + (l',2 , |[>2|l,2), (13) 

(l',2',3'|W3|l,2,3> = <l'|&i|l)<2'|I7i|2><3'|E/i|3> 

+ (1'\U 1 \1)(2',3'\U 2 \2,3) 

+ (2'|t/i|2)(l',3'|& 2 |l,3) 

+ (3'|[>i|3)(1 , ,2 , |[> 2 |1,2) 

+ (1', 2', 3'| C/ 3 11,2,3), (14) 

and so on. A particular Uj is thus defined with the help of the first l equations of the 
hierarchy. The last equation in this hierarchy will be (writing only the diagonal elements) 

W n (1,...,N) = J2 K][[7i() - [/i()][[/ 2 () - [/ 2 ()] -- , (15) 

[ p mi factors m 2 factors J 

where the primed summation goes over all sets {m/} that conform to the condition 

N 

J2lmi = N ; m t = 0 , 1,2 . (16) 

1=1 


12 The functions Uj were first introduced by Ursell, in 1927, in order to simplify the classical configuration integral. 
Their introduction into the quantum-mechanical treatment is due to Kahn and Uhlenbeck (1938). 
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Moreover, in selecting the arguments of the various [// appearing in (15), out of the num¬ 
bers 1,..., N, one has to remember that a permutation of the arguments within the same 
bracket is not regarded as leading to anything distinctly different from what one had 
before the permutation; the symbol J2p then denotes a summation over all distinct ways 
of selecting the arguments under the set {m/}. 

Relations inverse to the preceding ones are easy to obtain. One gets 

<l'|&i|l> = (l'|Wi|l>, (17) 

<1 / , 2' | L/ 211 ,2) = ( 1 / , 2 r | W 211 ,2) — (\'\Wi\\)(2!\Wi\2), (18) 

(1', 2', 3'| O 3 11,2,3) = (l',2',3'[W3|l,2,3) 

- (1 , |W 1 |1>(2 , ,3'|W' 2 |2,3> 

-<2'|Wi|2>(1',3'|W2|1,3> 

-(3'|Wi|3)(1',2'|W 2 |1,2) 

+ 2(l , [m 1 |l)(2'|mi|2)(3'|lVi|3), (19) 


and so on; compare the right sides of these equations with the expressions appear¬ 
ing within the parentheses in equations (10.4.5) through (10.4.7). We note that (i) the 
coefficient of a general term here is 


T.m ~1 
(- 1 )' 





( 20 ) 


where J2i m l is the number of the W n in the term, and (ii) the sum of the coefficients of 
all the terms on the right side of equations (18), (19), ... is identically zero. Moreover, the 
diagonal elements Uj{ 1,...,/), just like the diagonal elements of the operators W n , are sym¬ 
metric in respect of permutations among the arguments ( 1 ,..., Z), and are determined by 
the sequence of the diagonal elements W\, W 2 ,..., W t . Finally, in view of property (v) of the 
W n , as embodied in formula (9), the Ui possess the following property: 


I7/(1,...,J)~0 if Tij » a, r 0 ; (21) 

here, tij is the separation between any two of the coordinates (1,..., Z). 13 
We now define the “cluster integral” b / by the formula 

b,(V ’ T)= n^vf cw '-"’ ,)d3ir: (22) 

compare with equation (10.1.16). Clearly, the quantity bi(V,T ) is dimensionless and, by 
virtue of property (21) of the diagonal elements [7/(1,..., Z), is practically independent of V 
(so long as V is large). In the limit V —»• 00, b/(V, T) tends to a finite volume-independent 

13 This can be seen by examining the break-up of the structure on the right side of any equation in the hierarchy 
(18,19,...) when one or more of the l coordinates in the “cluster” get sufficiently separated from the rest. 
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value, which may be denoted by bi(T). We then obtain for the partition function of the 
system, see equations (5) and (15), 


Qn(Y> T) = 


1 


N\X 3N 

1 


(d-rjE' 

J [ f m t } 

E' 


. p 

N\ 


N\X 3N j mj ) (l!) m i ( 2!) m 2 • •. m\\m 2 \- 


/■ 


x / d 3iV r{[f/i• ■ • t/i][t/ 2 ■ ■ • t/ 2 ]• ■ ■ 


(23) 


(24) 


In writing the last result we have made use of the fact that, since a permutation among 
the arguments of the functions [// does not affect the value of the integral concerned, 
the summation over P may be replaced by any one term of the summation, multiplied 
by the number of distinct permutations allowed by the set {m/}; compare with the corre¬ 
sponding product of the numbers (10.1.22) and (10.1.24). Making use of the definition (22), 
equation (24) can be written as 


Qn(V,T ) = tEE 
X™ {m/J 


N 


i=i 


=E' 

[mil 


n( 

i=i 1 


bi 


V\ mi 1 

X 3 ) mil 


(25) 


again, use has been made of the fact that 


n 


3 E lm.i 

{ x 3 l ri=\i =x 3N . 


(26) 


Equation (25) is formally identical to equation (10.1.29) of Mayer’s theory. The subsequent 
development of the formalism, leading to the equation of state of the system, is formally 
identical to that theory. Thus, we finally obtain 

p 00 oo 

fcf = ^E^ and w 

i=i i=i 


There are, however, important physical differences. We may recall that the calculation 
of the cluster integrals bi in the classical case involved the evaluation of a number of finite, 
3Z-dimensional integrals. The corresponding calculation in the quantum-mechanical case 
requires a knowledge of the functions Uj and hence of all IV,,, with n < /; this in turn 
requires solutions of the //-body Schrodinger equation for all n < l. The case l — 2 can 
be handled neatly, as was done in Section 10.5. For l > 2, the mathematical procedure 
is rather cumbersome. Nevertheless, Lee and Yang (1959a,b; 1960a,b,c) have evolved a 
scheme that enables us to calculate the higher bi in successive approximations. According 
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to that scheme, the functions U] of a given system can be evaluated by “separating out” 
the effects of statistics from those of interparticle interactions, that is, we first take care of 
the statistical aspect of the problem and then tackle the dynamical aspect of it. Thus, the 
whole feat is accomplished in two steps. 

First, the [/-functions pertaining to the given system are expressed in terms of U- 
functions pertaining to a corresponding quantum-mechanical system obeying Boltzmann 
statistics, that is, a (fictitious) system described by unsymmetrized ivavefunctions. This 
step takes care of the statistics of the given system, that is, of the symmetry properties 
of the wavefunctions describing the system. Next, the U -functions of the (fictitious) Boltz- 
mannian system are expanded, loosely speaking, in powers of a binary kernel B which is 
obtainable from a solution of the two-body problem with the given interaction. A com¬ 
mendable feature of this method is that it can be applied even if the given interaction 
contains a singular, repulsive core, that is, even if the potential energy for certain configu¬ 
rations of the system becomes infinitely large. Though the method is admirably systematic 
and fairly straightforward in principle, its application to real systems is quite complicated. 
We will, therefore, turn to a more practical method — the method of quantized fields (see 
Chapter 11) — which has been extremely useful in the study of quantum-mechanical sys¬ 
tems composed of interacting particles. For a detailed exposition of the (binary collision) 
method of Lee and Yang, see Sections 9.7 and 9.8 of the first edition of this book. 

In passing, we note yet another important difference between the quantum- 
mechanical case and the classical one. In the latter case, if interparticle interactions are 
absent, then all bn with l> 2, vanish. This is not true in the quantum-mechanical case; 
here, see Sections 7.1 and 8.1, 


g[ 0) = (±i ) l ~ l r 5/2 , 

of which equation (10.5.27) was a special case. 


( 28 ) 


10.7 Correlations and scattering 

Correlations and scattering play an extremely important role in modern statistical 
mechanics. Different phases most are easily distinguished by different spatial orderings 
they display. Molecules in a low-density vapor are nearly uncorrelated whereas molecules 
in a dense liquid can be strongly correlated and display short-range order due to their 
strong steric repulsions but the correlations decay away rapidly at large distances. In crys¬ 
talline solids, the location of every particle is highly correlated with the location of all the 
others, and these correlations do not decay away to zero at large distances between the 
particles; this is called long-range order. At a critical point, systems display order that lies 
between short-range and long-range, with so-called quasi-long-range order characterized 
by a power-law decay of correlations. Crystals and liquid-crystal phases display molecu¬ 
lar orientational correlations that can be short-range, long-range, or quasi-long-range in 
addition to the various spatial orderings of the molecules. Different phases of magnets 
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are distinguished by the spatial orderings of the magnetic dipoles: short-range ordering in 
paramagnets, long-range ordering in ferromagnets and antiferromagnets, and power-law 
decay of correlations at magnetic critical points. 

Spatial correlation functions are based on //-particle densities. The one-body number 
density is defined by the average quantity 


n\(r) = 



( 1 ) 


This defines the local number density in which n\(r)dr is a measure of the probabil¬ 
ity of finding a particle inside an infinitesimal volume dr located at position r. If the 
system is translationally invariant, the one-body density is the usual number density 
n\ (r) — n = (N)/V. The spatial integral of the one-body density over volume V gives the 
average number of particles in that volume: 

J ti\{r)dr = (N ). (2) 

The two-body number density is defined as 


n 2 (r,r') = (^5(r-r,)«(r' - rfi). (3) 

' ¥j I 

The quantity n 2 (r, rfdrdr' is a measure of the probability of finding one particle inside the 
infinitesimal volume dr located at position r and another particle inside the infinitesimal 
volume dr’ located at position r'. In a dilute classical gas, the particles interact only when 
they are close to one another, so the probability of finding two different particles at two 
different locations many atomic diameters apart is simply the product of finding either 
particle individually, that is, ^(r.r') -> m(r)ni(r') as |r — r'| -* oo. It is the deviation from 
this uncorrelated behavior that is both interesting and important. The integral of the two- 
body density over volume V gives 

J n 2 (r, r’)drdr' = |iV 2 | - ( N). (4) 

If the system is translationally and rotationally invariant, the one-body number density is 
independent of position and the two-body number density depends only on the magni¬ 
tude of the distance between r and r'. This allows us to define the pair correlation function 

g(rf 


n 2 (r,r') = n 2 g(\r-r'\). 


(5) 
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FIGURE 10.5 An approximate pair correlation function for hard spheres with diameter D in three dimensions. The 
volume fraction i; = jrnLfi /6 ~ 0.49 is the fraction of the volume occupied by the particles and is close to the liquid 
side of the solid-liquid phase transition in the model. The correlation function is calculated using the exact solution 
of the Percus-Yevick approximation; see Percus and Yevick (1958), Wertheim (1963), and Hansen and McDonald 
(1986). The correlation length for this case is £ 2D. 


In three dimensions, Anrir 2 g(r) dr is the probability of finding a particle in a spherical 
shell of radius r and thickness dr, given that another particle is simultaneously located 
at the origin. The pair correlation function of a classical ideal gas is equal to unity; see the 
footnote to Problem 10.17. 

Figure 10.5 displays the pair correlation function g(r) for a system of hard spheres 
interacting via pair potential 


u(r) = 



if r > D, 
if r<L>. 


( 6 ) 


Clearly, the pair correlation function vanishes for r < D since no two particles in the system 
can be closer to each other than D due to the infinite repulsion. These steric repulsions 
result in an oscillatory decay of g(r). The pair correlation function is greater than unity 
at separations slightly greater than D since the local geometry of the fluid enhances the 
probability of finding two particles a distance slightly more than D apart; for illustration, 
see Figure 10.6. The pair correlation function is less than unity at slightly larger distances 
due to the repulsion of the cluster of particles just outside the hard repulsion distance. 
The oscillating correlations decay rapidly with distance, so that g(r) approaches unity at 
large separations. This behavior of the pair correlation function is typical of all dense fluids 









334 Chapter 10 • Statistical Mechanics of Interacting Systems 



FIGURE 10.6 An equilibrium configuration of hard disks that displays steric effects leading to oscillations in the pair 
correlation function. The inner dashed circle with radius D is the closest approach distance to the central disk. In 
this case, the centers of five disks are close to the distance D which contributes to the enhancement in g(r) near 
r = D. The outer dashed circle shows the next shell of particles that contribute to the second peak in g(r). 
In-between these distances, we have a reduced probability of finding the center of a particle, leading to g(r) < 1. 


and is called short-range order since the correlations decay exponentially with distance: 
g(r) — 1 ~ exp (—r/f), where £ is called the correlation length. 

The pair correlation function can be used to directly calculate the pressure in a fluid. 
For a classical fluid whose potential energy can be written as a sum of pair potentials, 

U N {ri,r 2 ,...,r N ) = J2 u ( r ij), (7) 

i<j 

the pressure is determined by the average of the quantity r(du/dr) between pairs of par¬ 
ticles, as discussed in Section 3.7. In the canonical ensemble, the pressure P is given by 

3 A \ kT ( dZpj\ 
dV/ TN Z N \ 3 V ) TN 

where Z,y is the configurational partition function 

z n = ^J rf,V r exp ^-/f J2 u(/ij)\ 

The d- dimensional integrals over the volume V can be rewritten in terms of a set of scaled 
variables {$,■} defined by r,- — so the scaled integrals are over regions with unit 



( 8 ) 


(9) 







10.7 Correlations and scattering 335 


volume: 

Zn=v N \f dNs exp T- 1 . u ( v ' /ds ‘j >\■ (10) 

Equations (8) and (10) then give 

P=nkT { 1 -2hffr ,gmdr )- 011 

This is called the virial equation of state and is useful for determining pressure from 
approximate expressions for the pair correlation function. Compare equation (11) with 
the form of the virial equation of state in equation (3.7.15). 

For the particular case of hard spheres, the discontinuous potential results in the pres¬ 
sure being determined by the pair correlation function at contact. In one, two, and three 
dimensions, the hard sphere pressure is given by 

l + rig(D + ) r] = nD d— 1, 

1 +2rig(D + ) r,= \nD 2 d= 2, (12) 

l + 4rig(D + ) r] = ^nE) i d = 3, 

where g(D + ) is the correlation function at contact and i] is the volume fraction, that is, the 
fraction of the d-dimensional volume of the sample occupied by the spheres; see Problem 
10.14. Likewise, the internal energy of the fluid can be written as an integral over the pair 
correlation function and the pair potential: 

U(N, V, T) = (H) = d ^L + n ^- J u{r)g(r)dr. (13) 

The pair correlation function itself contains all the statistical information needed to 
construct the full thermodynamic behavior of the system. For example, equation (4) can 
be used to show that the isothermal compressibility, which is proportional to the number 
density fluctuations, is also proportional to an integral over the pair correlation function: 

K T f lN 2 )-{N) 2 

nkTKT = is = 1 + «_/ - 1 )dr = -; (14) 

this is known as the compressibility equation of state. Since Kf 1 — j , one can use 
equation (14) to determine the pressure and free energy of the system by performing 
thermodynamic integrations with respect to the particle density. 

10.7.A Static structure factor 

The pair correlation function g(r) can be measured experimentally using quasielastic 
scattering. If a sample is illuminated with a monochromatic beam of x-rays, neutrons, vis¬ 
ible light, and so on, the scattered intensity as a function of the angle from the incident 


pHS 

nkT 
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beam direction is proportional to the Fourier transform of g(r). The quasielastic scattering 
amplitude from a single particle at location r, illuminated by a plane wave with amplitude 
0o and wavevector Icq into a detector at location R is 

e ik(yr ie iky(R-ri) 

d>i(fc) = 0 o /(fc)-----, (15) 

l-K- f/| 

where 1c = k\ — Icq is the wavevector transfer and f(1c) is the single-particle scattering 
form factor; see Figure 10.7. The total scattering amplitude from the N particles in the 
sample is 


(16) 

where we have assumed that the detector is far from the sample. The scattered intensity 
from the IV-particle sample is 

I N (k) = ^(fc)! 2 « - Nh(k)S(k), (17) 



FIGURE 10.7 Scattering from two particles. The incident wavevector is kg, the scattered wavevector toward the 
detector is fci, and the wavevector transfer is k= fci - fco. Since |fci| = |fc 0 j for quasielastic scattering, the magnitude 
of the wavevector transfer is fc= 2fcosin(@/2), where 0 is the angle between fco and fci. 
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where h (k) is the scattering intensity from a single particle and 

S(k) = ^[j2 cx P(- ik - (r i ~ rj))j (18) 

is the static structure factor. It represents the actual scattering intensity divided by the 
scattering intensity from an imaginary randomly distributed and, therefore, uncorrelated 
sample of atoms at the same particle density n. 

If the sample is translationally invariant and isotropic, as in a uniform fluid, the 
static structure factor depends only on the magnitude of the wavevector transfer, that is 
S(k) — S(k). For that case, S(k) can be written as the Fourier transform of the pair correla¬ 
tion function: 


S(k) = 1 + ^ j (g(r) - l)e‘ 


ikr dr+ ^ |/ e* 


ikr 


dr 


(19) 


The final term in equation (19) represents the forward shape scattering of the sample 
volume. The shape scattering term is negligible for k » l/L, so in the thermodynamic limit 
it can be ignored for k f 0. The structure factor for isotropic fluids in one, two, and three 
dimensions is then given by 


OO 


S(Jfc) = 1 + 2 n j ( 
0 

c 

'g(r) — l)cos (kr)dr 

O 

d= 1, 

(20a) 

S(k) = \ + 2nn J 
0 

r(g(r) - l)J 0 (kr)dr 

OO 

d = 2, 

(20b) 

„ , 4jr n 

S{k ) = 1 + -p 

f r(g(f) — l)sin(fcr)dr 

d= 3 . 

(20c) 


0 


The pair correlation function g(r ) can be determined using the inverse Fourier trans¬ 
form of the measured structure factor, as shown in Figure 10.8. For liquids and other 
short-range ordered materials, the structure factor tends to unity as k -* oo. The value of 
S(k) as k -y 0 is a measure of the number density fluctuations in the sample: 

f KT (N 2 )-(N) 2 

lim S(k) = 1 + n / (g(r) — 1 )dr = — T = -— -. (21) 

k-*o J 6 4 deal (N) 

Equation (21) is called the fluctuation-compressibility relation and is the equilibrium limit 
of the fluctuation-dissipation theorem we will discuss in Section 15.6. 
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FIGURE 10.8 Experimentally measured pair correlation function g(r) and structure factor S(k) for liquid argon 
at 85 K. The structure factor (b) is determined from neutron scattering and the pair correlation function (a) is 
determined from the inverse Fourier transform of the structure factor. The small oscillations in g(r) near r = 0 are an 
experimental artifact of the Fourier transformation of the scattering data. This figure displays the typical features 
of correlations in fluids: nearly zero g(r) at short distances, large g(r) for particles separated by approximately a 
molecular diameter, oscillatory decay of correlations to unity at large separations, small S(k) at small wavevector 
due to the small compressibility of dense fluids, and S(fc) approaching unity at large wavevectors. Figures from 
Yarnell, Katz, Wenzel, and Koenig (1973). Reprinted with permission; copyright ©1973, American Physical Society. 


10.7.B Scattering from crystalline solids 

In an ideal crystalline solid, the atoms in the crystal are located at the sites of a periodic 
structure. For a simple crystal, identical atoms are sited on a Bravais lattice {R}. For exam¬ 
ple, a simple cubic lattice has lattice vectors R e \(ri]X + n 2 y+ n^z)a], where m, n 2 , and 
nj, are integers and a is the lattice constant. The reciprocal lattice {G} is defined by the 
set of reciprocal lattice vectors G — such that G ■ R — 2nm, where m is an integer for all 
{G} and {R}. The reciprocal lattice of the simple cubic lattice is also a simple cubic lat¬ 
tice: G e \ (ni\x + m 2 y + m 2 z )^ [, where m\, m 2 , and m 3 , are integers. For a perfect Bravais 
lattice, the structure factor S(lc) is of the form 



where q is the Kronecker delta. The structure factor is enhanced by a factor of N on 
each reciprocal lattice vector due to the coherent constructive interference of scattering 
from the long-range ordered array of atoms. One can determine the crystal structure of the 
solid from the experimental pattern of these sharp Bragg peaks; see Ashcroft and Mermin 
(1976). 

Thermal excitations cause atoms to deviate from their equilibrium positions. The dis¬ 
placed position of an atom whose equilibrium position is R can be written R + u{R), 
where u(R ) denotes the displacement from equilibrium. As long as the atoms remain 
close to their lattice sites, the sharp Bragg peaks in the structure factor will also remain 
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but the intensity of each peak will be reduced by an amount dependent on the aver¬ 
age of the squares of the deviations \u{R)\ 2 ). This turns out to be the case for normal 
three-dimensional solids. The structure factor then takes the form 

S(fc) = ^J2 e ‘ kiR ~ R) 

R,R' 

If the excitations about the equilibrium positions are Gaussian (i.e., the terms in the Hamil¬ 
tonian higher than second order in u{R) can be ignored), then the average of the deviations 
in the exponential can be simplified to give 

/ e ifc-(«(B)-H(R'))\ _ e -|(|fc (H(R)-«(R')l 2 )_ (24) 

If the displacements of the atoms far from each other on the lattice are uncorrelated, as 
they are in three-dimensional crystals, 

1 / 2 \ k 2 (u 2 ) 

2 yk- (u(R) — u(R )\ 2 j « — ' for | -s- oo, (25) 

then the structure factor takes the form 


J,k(u(R)—u{R! 




(23) 


S(k) = N WqS^c 
G 


(26) 


where 


W G = exp 



(27) 


is called the Debye-Waller factor. The random atomic deviations from lattice sites reduces 
the intensity in the Bragg peaks but the sharp scattering indicative of long-range crystalline 
order remains intact; see Ashcroft and Mermin (1976). 

An interesting variant of this calculation occurs in two-dimensional solids. Peierls 
(1935) and Landau (1937) showed that harmonic thermal fluctuations in two dimensions 
destroy crystalline long-range order. This was generalized by Mermin (1968) to show that 
long-range crystalline order was not possible for any two-dimensional system of parti¬ 
cles with short-range interactions. Two-dimensional solids exhibit power-law decay of 
translational correlations while maintaining long-range order in the lattice orientational 
correlations. This leads to power-law singularities rather than delta-functions in the static 
structure factor. It is possible for the solid to melt via two Kosterlitz-Thouless-like continu¬ 
ous transitions rather than a single first-order transition. The intervening “hexatic” phase 
exhibits short-range translational correlations and quasi-long-range orientational corre¬ 
lations; see Section 13.7, Kosterlitz and Thouless (1972,1973), Halperin and Nelson (1978), 
and Young (1979). 
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Problems 


10.1. For imperfect-gas calculations, one sometimes employs the Sutherland potential 


u(r) = 


oo 

- e(D/r ) 6 


for r < D 
for r > D. 


Using this potential, determine the second virial coefficient of a classical gas. Also determine 
first-order corrections to the ideal-gas law and to the various thermodynamic properties of the 
system. 

10.2. According to Lennard-Jones, the physical behavior of most real gases can be well understood if 
the intermolecular potential is assumed to be of the form 

u(r ) = ^ . 


where n is very nearly equal to 6 while m ranges between 11 and 13. Determine the second virial 
coefficient of a Lennard-Jones gas and compare your result with that for a van der Waals gas; see 
equation (10.3.8). 

10.3. (a) Show that for a gas obeying van der Waals equation of state (10.3.9), 


Cp-C v =Nk 


1 - 


2 a 
kTv 3 


(v-b) z 



(b) Also show that, for a van der Waals gas with constant specific heat Cy, an adiabatic process 
conforms to the equation 


(v — b)T Cv ! Nk = const; 


compare with equation (1.4.30). 

(c) Further show that the temperature change resulting from an expansion of the gas (into 
vacuum) from volume Vi to volume V 2 is given by 


Tz-Ti 


N 2 a / 1 

~c7 \V 2 



10.4. The coefficient of volume expansion a and the isothermal bulk modulus B of a gas are given by 
the empirical expressions 


a= - 1 


3 a' 
vT 2 


and B = P ( 1 


vT 2 


where a' is a constant parameter. Show that these expressions are mutually compatible. Also 
derive the equation of state of this gas. 

10.5. Show that the first-order Joule-Thomson coefficient of a gas is given by the formula 


= — l r 
3 P) H C P 


N ( rr 3{a 2 X i ) 
3 T 


- a 2 \' 


where a 2 (T ) is the second virial coefficient of the gas and H its enthalpy; see equation (10.2.1). 
Derive an explicit expression for the Joule-Thomson coefficient in the case of a gas with 
interparticle interaction 


I +oo 
-Mo 
0 


for 0 < r < D, 
for D <r < r\ } 
for n < r < oo. 


and discuss the temperature dependence of this coefficient. 
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10.6. Assume that the molecules of the nitrogen gas interact through the potential of the previous 
problem. Making use of the experimental data given next, determine the “best" empirical values 
for the parameters D, n, and ug/k: 

T (in K) I 100 200 300 400 500 

a 2 k 3 (in K per atm) -1.80 -4.26X 10- 1 -5.49 x 10' 2 +1.12xlO” 1 +2.05X10- 1 . 

10.7. Determine the lowest-order corrections to the ideal-gas values of the Helmholtz free energy, the 
Gibbs free energy, the entropy, the internal energy, the enthalpy, and the (constant-volume and 
constant-pressure) specific heats of a real gas. Discuss the temperature dependence of these 
corrections in the case of a gas whose molecules interact through the potential of Problem 10.5. 

10.8. The molecules of a solid attract one another with a force F(r) = a(l/r) 5 . Two semi-infinite solids 
composed of n molecules per unit volume are separated by a distance d, that is, the solids fill the 
whole of the space with x < 0 and x>d. Calculate the force of attraction, per unit area of the 
surface, between the two solids. 

10.9. Referring to equation (10.5.31) for the phase shifts i][(k) of a hard-sphere gas, show that for fcD « 1 

, (kP) 2l+l 

m ' C ' ( 2 /+ 1){1 ■ 3 ■ • ■ ( 2 / — l )} 2 ' 

10.10. Using the wavefunctions 

u p (r) = -J- e u P r)h 
s/V 

to describe the motion of a free particle, write down the symmetrized wavefunctions for a pair of 
noninteracting bosons/fermions, and show that 

(l',2'\U% /A \l,2) =±(2'|Wi|l)(l'|Wi|2). 

10.11. Show that for a gas composed of particles with spin J 

B 2 (7) = (/+ 1)(2/+ UB^O) +7(27 + 1)0 2 (0) 

and 

fif(7) = 7(2/ +1) ®2 (0) + (/ + 1) (2/ + 1) 82 (0) • 


10.12. Show that the coefficient S 2 for a quantum-mechanical Boltzmannian gas composed of 
“spinless” particles satisfies the following relations: 


g 2 = Lim 

/-+oO 


1 


(27+ l) 2 


8f(7) 


= Lim 

/—> OO 


1 


(27+1) 2 


KU) 


= -{C|(0) + #(0)}. 


Obtain the value of 0 2 , to fifth order in (D/X), by using the Beth-Uhlenbeck expressions in 
equations (10.5.36) and (10.5.37), and compare your result with the classical value of 0 2 , 
namely —(2n/3 )(D/X) i . 

10.13. Use a virial expansion approach to determine the first few nontrivial order contributions to the 
pair correlation function g(r) in d dimensions. Show that the pair correlation function is of the 
form g(r) = e~^ u( - r> y(r), where u(r) is the pair potential andy(r) is a smooth function of r. Show 
that even for the case of hard sphere interaction, y(r) and its first few derivatives are continuous. 

10.14. For the particular case of hard spheres, the pressure in the virial equation of state is determined 
by evaluating the pair correlation function at contact. Write the pair correlation function as 
g(r) = e~P u(r) y(r) and derive equations (10.7.12) for hard spheres in one, two, and three 
dimensions. 

[Hint-. For hard spheres, the Boltzmann factor e~P u( - r) is a Heaviside step function]. 
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10.15. Derive the probability distribution w{r) for the distance to the closest neighboring particle using 
the pair correlation function g(r) and the number density n. Show that in three dimensions 


u>{r) = 47tnr 2 g(r)exp | — J4irns 2 g(s)ds 


and the average closest-neighbor distance for an ideal gas is 



10.16. Consider a gas, of infinite extent, divided into regions A and B by an imaginary sheet running 
through the system. The molecules of the gas interact through a potential energy function u(r). 
Show that the average net force F experienced by all the molecules on the A-side of the sheet 
caused by all the molecules on the B-side are perpendicular to the plane of the sheet, and that its 
magnitude (per unit area] is given by 


F 

A 


2rtn 2 

3 


/(f) g(r)rW - 


10.17. 


Show that for a gas of noninteracting bosons, or fermions, the pair correlation function g(r) is 
given by the expression 

OO 2 

or r e npr)/h d 3 p 

g(r) = l± 6 / --— , 

° n 2 h 6 J e (p 2 / 2 m-M)/fcT -p I 

— OO 


where g s (= 2s + 1) is the spin multiplicity factor. Note that the upper sign here applies to bosons, 
the lower one to fermions. 14 

[Hint: To solve this problem, one may use the method of second quantization, as developed in 
Chapter 11. The particle density operator h is then given by the sum, 


J2 a l a /iK(r)Up(r), 

a,/3 


whose diagonal terms are directly related to the mean particle density n in the system. The 
nondiagonal terms give the density fluctuation operator (h — n), and so on; see equation (11.1.25).] 
10.18. Show that, in the case of a degenerate gas of fermions (T << Tp), the correlation function g(r), for 
r 3> h/pp, reduces to the expression 


g(r)-l = - 


3(rnkT) 2 

4pphr 2 


sinh ( 


mnkTr\ 

V PfH ) 


Note that, as T ->■ 0, this expression tends to the limiting form 

3fi 


g(r) - 1 = - 


1 

-o-7 OC — 7 . 

47 r z ppr i r 4 


14 Note that, in the classical limit ( h 0), the infinitely rapid oscillations of the factor exp(/(p- r)/h) make the integral 
vanish. Consequently, for an ideal classical gas, the function g(r) is identically equal to 1. Quantum-mechanical systems 
of identical particles exhibit spatial correlations due to Bose and Fermi statistics even in the absence of interactions. It is 
not difficult to see that, for n>? <SC 1 where A = h/^/(IrankT), 

g(r) ~ 1 ± T exp(—2:zrr 2 /A 2 ); 


compare with equation (5.5.27). 
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10.19. (a) For a dilute gas, the pair correlation function g(r) may be approximated as 

g(r) ~ exp{—u(r)/kT], 

Show that, under this approximation, the virial equation of state (10.7.11) takes the form 

OO 

PV f o 

-^ 1-2 nnjf(r)rdr, 

o 

where/(r) [= exp{— u(r)/kT] - 1] is the Mayer function, equation (10.1.6). 

(b) What form will this result take for a gas of hard spheres? Compare your result with that of 
Problem 1.4. 

10.20. Show that the pressure and Helmholtz free energy of a fluid at temperature T can be determined 
by performing a thermodynamic integration of the inverse of the isothermal compressibility from 
the chosen density to the ideal gas reference state. 

10.21. Show that, for a general Gaussian distribution of variables Uj, the average of the exponential of a 
linear combination of the variables obeys the relation 

| ex p(lZ a / M /)) = ex P 

10.22. Calculate the isothermal compressibility and Helmholtz free energy for the Carnahan-Starling 
equation of state (10.3.25) and show that the Helmholtz free energy is given by 

/3A _ /3A ideai | 77(4 - 377 ) 

IV “ N + (I-7?) 2 ’ 

where 7l ldeal is the Helmholtz free energy of a classical monatomic ideal gas at the same density. 

10.23. The virial expansion for a two-dimensional system of hard disks gives the following series when 
expressed in terms of the two-dimensional packing fraction 77 = ir 7 tD 2 / 4 : 

p 

= 1 + 2t? + 3.12801877 2 + 4.257854T) 3 + 5.336896647J 4 + 6.36302677 5 

nkT 

+ 7 . 35208077 6 + 8.3I866877 7 + 9.2723677 s + IO.2I6I77 9 + • • •; 

see Malijevsky and Kolafa (2008). Propose some simple analytical functions/( 77 ) that closely 
approximate this series. 








Statistical Mechanics 
of Interacting Systems: The 
Method of Quantized Fields 


In this chapter we present another method of dealing with systems composed of interact¬ 
ing particles. This method is based on the concept of a quantized field that is characterized 
by the field operators fir), and their hermitian conjugates f f (r), which satisfy a set of 
well-defined commutation rules. In terms of these operators, one defines a number oper¬ 
ator N and a Hamiltonian operator H that provide a suitable representation for a system 
composed of any finite number of particles and possessing any finite amount of energy. In 
view of its formal similarity with the Schrodinger formulation, the formulation in terms 
of a quantized field is generally referred to as the second quantization of the system. 
For convenience of calculation, the field operators f (r) and f\r) are often expressed as 
superpositions of a set of single-particle wavefunctions {u a (r)}, with coefficients a u and 
ah] the latter turn out to be the annihilation and creation operators, which again satisfy 
a set of well-defined commutation rules. The operators N and H then find a convenient 
expression in terms of the operators a a and ah , and the final formulation is well-suited 
for a treatment based on operator algebra; as a result, many calculations, which would 
otherwise be tedious, can be carried out in a more or less straightforward manner. 


11.1 The formalism of second quantization 

To represent a system of particles by a quantized field, we invoke the field operators fir) 
and fHr), which are defined for all values of the position coordinate r and which operate 
on a Hilbert space] a vector in this space corresponds to a particular state of the quantized 
field. The values of the quantities f and f f at all r, represent the degrees of freedom of the 
field; since r is a continuous variable, the number of these degrees of freedom is innumer¬ 
ably infinite. Now, if the given system is composed of bosons, the field operators f (r) and 
f Hr) satisfy the commutation rules 

[f(r),fHr')] = 8(r-r') (la) 

[fir), fir 1 )] = [fhr),fHr')] = 0, (lb) 
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where the symbol [A,B] stands for the commutator ( AB - BA) of the given operators A and 
B. If, on the other hand, the given system is composed of fermions, then the field operators 
satisfy the rules 


{Vdr),^V)} = <5(r-r') (2a) 

mr),ir (r')J = V)} = 0, (2b) 

where the symbol {A, B} stands for the anticommutator (AB + BA) of the given operators 
A and B. In the case of fermions, the operators ^(r) and V^(r) possess certain explicit 
properties that follow directly from (2b), namely 

i/r(r)i/r(r') = — \lf(r')ilr(r), ^r(r)^r(r) = 0 for all r; (2c) 


similarly, 


i l/^(r)i/f^(r')=—^(r^x/fHr), \jr^{r)^{r) = 0 for all r. (2d) 

Clearly, no such property holds for the field operators pertaining to bosons. In the sequel 
we shall see that the mathematical difference between the commutation rules (1) for the 
boson field operators and rules (2) for the fermion field operators is intimately related to 
the fundamental difference in the symmetry properties of the respective wavefunctions in 
the Schrodinger formulation. Of course, in their own place, both sets of rules, (1) and (2), 
are essentially axiomatic. 

We now introduce two hermitian operators, the particle-number operator N and the 
Hamiltonian operator H, through definitions that hold for bosons as well as fermions: 

N= f d 3 r^ t (r)^r(r) (3) 


and 


» h 2 r 

H = -/ d 3 n[r 1 (r)V 2 i/f (r) 

2m J 

+ \jj d 3 rid 3 r 2 \!f HriW(r 2 )u(ri } r2)i(r2)'lf(ri), (4) 

where u(ri,rz) denotes the two-body interaction potential in the given system. It is quite 
natural to interpret the product ^ as the number density operator of the field. The 

similarity between the foregoing definitions and the expressions for the expectation values 
of the corresponding physical quantities in the Schrodinger formulation is fairly obvious. 
However, the similarity is only “formal” because, while there we are concerned with the 
wavefunctions of the given system (which are c-numbers ), here we are concerned with 
the operators of the corresponding matter field. We can easily verify that, irrespective of 
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the commutation rules obeyed by the operators \Jr(r) and i/d(r), the operators N and H do 
commute: 


{N,H] = 0; 


(5) 


accordingly, the operators N and H can be diagonalized simultaneously. 

We now choose a complete orthonormal basis of the Hilbert space, such that any vec¬ 
tor |<J>rc) among the basis is a simultaneous eigenstate of the operators N and H. We 
may, therefore, denote any particular member of the basis by the symbol \^ne), with the 
properties 


Nl'l> ne) = N\'Vne), HI'Vne) = E^ne) 


( 6 ) 


and 


{'^neI'^ne) = 1- (7) 

The vector |^ 0 o)> which represents the vacuum state of the field and is generally denoted 
by the symbol |0>, is assumed to be unique; it possesses the obvious properties 

N\0)=H\0) = 0 and (0|0) = 1. (8) 

Next we observe that, regardless of whether we employ the boson commutation rules 
(1) or the fermion rules (2), the operator N and the operators i Js(r) and i/d(r) satisfy the 
commutation properties 


[i/r(r),lV] = i/r (r) and [i/d(r),A/] = — i/d(r), 


from which it follows that 


and 


Nf{r)\* m ) = (V(r)N-Vr(r)) \V m ) = (N— m(r)\^ NE ) 


(9) 


( 10 ) 


Clearly, the state ir(r)\^>NE) is also an eigenstate of the operator N, but with eigenvalue 
(N - 1); thus, the application of the operator i//(r) onto the state T.v/;) of the field anni¬ 
hilates one particle from the field. Similarly, the state iA t (r)|4'jvB) is an eigenstate of the 
operator N, with eigenvalue (N + 1); thus, the application of the operator ij/'ir) onto the 
state 14 >ne) of the field creates a particle in the field. In each case, the process (of annihi¬ 
lation or creation) is tied down to the point r of the field; however, the energy associated 
with the process, which also means the change in the energy of the field, remains undeter¬ 
mined; see equations (18) and (19). By a repeated application of the operator i/d onto the 
vacuum state |0), we find that the eigenvalues of the operator N are 0,1,2,.... 


348 Chapter 11 • Statistical Mechanics of Interacting Systems 


On the other hand, the application of the operator x/s onto the vacuum state |0) gives 
nothing but zero because, for obvious reasons, we cannot admit negative eigenvalues for 
the operator N. Of course, if we apply the operator i// onto the state I'I'ye) repeatedly N 
times, we end up with the vacuum state; we then have, by virtue of the orthonormality of 
the basis chosen, 


{4>„| Vr(n)^(r 2 )... 1r(.r N )\*NB) = 0 (12) 

unless the state |3>„) is itself the vacuum state, in which case we would obtain a nonzero 
result instead. In terms of this latter result, we may define a function of the N coordinates 
ri,r 2 ,...,r N , namely 


. ■ ■, rjv) = (7V!)-b 2 (0|i/r(n) ... MrNWm)- (13) 

Obviously, the function 'hjvfifri,..., rjv) has something to do with an assemblage of N par¬ 
ticles located at the points ri,..., rpj of the field because their annihilation from those very 
points of the field has led us to the vacuum state of the field. To obtain the precise meaning 
of this function, we first note that in the case of bosons (fermions) this function is sym¬ 
metric (antisymmetric) with respect to an interchange of any two of the N coordinates; 
see equations (lb) and (2b), respectively. Secondly, its norm is equal to unity, which can be 
seen as follows. 

By the very definition of Tty/Tr,,..., r,y), 

J d 3S r^(n,. .., r N )^>m(ri, ■ ■ ■, r N ) 

= (N!)~ 1 f d 3N r(^ m \^\r N )...^(ri)\0)m(ri)...^(r N )\^ NE ) 

= UV!)“ 1 f d 3N rJ2('i , NE\^(r N )...i,\n)\<t> n )(^ n \^(r 1 )...xlr(r N )\^ N E) 

^ n 

= (iV!)- 1 f d 3N r(^ NE \^(r N )... f \r 2 nhriW (n)f(r 2 ).. .f(r N )\^ NE y, 

here, use has been made of equation (12), which holds for all |<J>„) except for the vacuum 
state, and of the fact that the summation of |4>„)(O n | over the complete orthonormal set 
of the basis chosen is equivalent to a unit operator. We now carry out integration over n, 
yielding the factor 

J d 3 rn/fVi)Vdn) = N. 

Next, we carry out integration over r 2 , yielding the factor 

J d 3 r 2 f\r 2 )N^{r 2 ) = J d 3 r 2 ^ (r 2 )f(r 2 )(N - 1 ) = N(N- 1 ); 
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see equation (10). By iteration, we obtain 


J d™ r'i’* N[; (r\,.. .,r N )^ NE {n,. ,.,r N ) 

= (JV!) _1 ('I'jvE|iV(lV — l)(iV — 2) ... up to AT factors| ^/ve) 
= ( NI)~ 1 NI{'Iinb\^ne ) = 1 . 


(14) 


Finally, we can show that, for bosons as well as fermions, the function l. ■ • • > r ,\ ! ) 
satisfies the differential equation, see Problem 11.1, 



which is simply the Schrodinger equation of an )V-particle system. The function 
4 / jv£(JT, - >rjv) is, therefore, the Schrodinger wavefunction of the system, with energy 
eigenvalue E; accordingly, the product is the probability density for the particles 

of the system to be in the vicinity of the coordinates (ri,...,rjv)> when the system hap¬ 
pens to be in an eigenstate with energy E. This establishes the desired correspondence 
between the quantized field formulation and the Schrodinger formulation. In passing, we 
place on record the quantized-field expression for the function ,... ,rjv), which is 

the complex conjugate of the wavefunction T> ■ ■ ■ > rjv), namely 


*jvB(ri,. = (iV!)- 1/2 ('I/ JV£ |^ t (r N )... f f (ri)|0). 


(16) 


We now introduce a complete orthonormal set of single-particle wavefunctions u a (r), 
where the suffix a provides a label for identifying the various single-particle states; it 
could, for instance, be the energy eigenvalue of the state (or the momentum p, along 
with the spin component a pertaining to the state). In view of the orthonormality of these 
wavefunctions, 



(17) 


The field operators i jr{r) and \l/ f (r) may now be expanded in terms of the functions u a (r): 


fir) = 


(18) 


a 


and 


fUr) = J2 a a u *c*(r)- 


(19) 
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Relations inverse to equation (18) and (19) are 

a a = J d 3 rijr(r)u*(r) (20) 

and 

a} a = J d 3 r-j/ f (r)u a (r). (21) 

The coefficients a a and ah, like the field variables \!/(r) and ffr), are operators that oper¬ 
ate on the elements of the relevant Hilbert space. Indeed, the operators a a and a f a now take 
over the role of the degrees of freedom of the field. 

Substituting (18) and (19) into the set of rules (1) or (2), and making use of the closure 
property of the u a , namely 


^2 u a (r) if (r') =8(r- r'), (22) 


we obtain 1 for the operators a a and a a the commutation relations 


'Cl 

a 

oo 

II 

(23a) 

\CL a ,Clp] — \Q*ol> Clp ] — 0 

(23b) 

in the case of bosons , and 


{Cion Clp] — S a fi 

(24a) 

{ Clo /, Clfi } — {CLcnClp} — 0 

(24b) 

in the case of fermions. In the latter case, the operators a a and al 
properties that follow directly from (24b), namely 

possess certain explicit 

CLoiCip = —o-ffUci, a a a a = 0 for all cc, 

(24c) 

similarly 


CL(yCio — a*da, . . oiaCia — 0 for alia. 

(24d) 


No such property holds for operators pertaining to bosons. We will see very shortly that 
this vital difference between the commutation rules for the boson operators and those for 
the fermion operators is closely linked with the fact that while fermions have to conform 
to the restrictions imposed by the Pauli exclusion principle, there are no such restrictions 
for bosons. 


Alternatively, one may employ equations (20) and (21), and make use of rules (1) or (2) along with equation (17). 
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We now proceed to express operators N and H in terms of a a and ah. Substituting (18) 
and (19) into (3), we obtain 

N= d 3 rYh a iap u* a (r) up (r) = ^ a a ap 8 af} 

J <X,P a,p 

— cIqi . (25) 

a 

It seems natural to speak of the operator ala a as the particle-number operator pertaining 
to the single-particle state a. We denote this operator by the symbol N a : 

N a = ah a a . (26) 

It is easy to verify that, for bosons as well as fermions, the operators N a commute with one 
another; hence, they can be simultaneously diagonalized. Accordingly, we may choose a 
complete orthonormal basis of the Hilbert space in such a way that any vector belonging 
to the basis is a simultaneous eigenstate of all the operators N a - 2 Let a particular member 
of the basis be denoted by the vector \no,n\,...,n a ,...), or by the shorter symbol |3>„), with 
the properties 

N a \®n) = n a \<S> n ) (27) 


and 


(4>„|<D„) = 1 ; (28) 

the number n a , being the eigenvalue of the operator N a in the state |3>„) of the field, 
denotes the number of particles in the single-particle state a of the given system. One par¬ 
ticular member of the basis, for which n a = 0 for all a, will represent the vacuum state of 
the field; denoting the vacuum state by the symbol | T () ), we have 

N a | <t>o) = 0 forallff, and <<J> 0 1 < t > o) = 1 - (29) 

Next we observe that, regardless of whether we employ the boson commutation rules 
(23) or the fermion rules (24), the operator N a and the operators a u and a u satisfy the 
commutation properties 


\cIqi , IV 7 ] — u a and [a a , IV y ] — ci u , 


(30) 


from which it follows that 


Nadu 1 — (ci a N a — ct a )\^>n) — (n a — l)^a'|4 > ,;) 


( 31 ) 


2 This representation of the field is generally referred to as the particle-number representation. 
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and 


N a Cl a \^n) —(rZo-A/a + Oa)\^n) — (n a A (32) 

Clearly, the state a a |O n ) is also an eigenstate of the operator N a , but with eigenvalue 
(n a — 1); thus, the application of the operator a„ onto the state |3>„) of the field annihi¬ 
lates one particle from the field. Similarly, the state ah |4>„) is an eigenstate of the operator 
N a , with eigenvalue (n a + 1); thus, the application of the operator a u onto the state |<t>„) 
creates a particle in the field. The operators a a and a u are, therefore, referred to as the 
annihilation and creation operators. Of course, in each case the process (of annihilation 
or creation) is tied down to the single-particle state a; however, the precise location of the 
event (in the coordinate space) remains undetermined; see equations (20) and (21). Now, 
since the application of the operator a a or ah onto the state | <t> n ) of the field does not affect 
the eigenvalues of the particle-number operators other than N a , we may write 

a a \no,ni,= A(n a )\no,ni,...,n a - 1,...) (33) 

and 

al\n 0t B(n a )\n 0 , m ,..., n a + 1, ...), (34) 

where the factors A{n a ) and B(n a ) can be determined with the help of the commutation 
rules governing the operators a a and ah . For bosons, 

A(n a ) = Jn a , B(n a ) = ^/(n a + 1); (35) 

consequently, if we regard the state |O n ) to have arisen from the vacuum state <t> 0 ) by a 
repeated application of the creation operators, we can write 

1 + fin f Hi + ll a 

l<t»«) = ,, , , -—(ab V|) ■■■(<* ■ • • l^o). (36) 

In the case of fermions, the operators ah anticommute, with the result that aha'., — —a^ah; 
consequently, there would remain an uncertainty of a phase factor ±1 unless the order 
in which the a' a operate on the vacuum state is specified. To be definite, let us agree 
that, as indicated in equation (36), the ah are arranged in the order of increasing sub¬ 
scripts and the phase factor is then +1. Second, since the product ah ah now vanishes, 
none of the n a in (36) can exceed unity; the eigenvalues of the fermion operators N a are, 
therefore, restricted to 0 and 1, which is precisely the requirement of the Pauli exclusion 
principle. 3 Accordingly, the factor [Yl a (n a \)}~ 112 in (36) would be identically equal to unity. 

3 This can also be seen by noting that the fermion operators N a satisfy the identity 

Nl = dl a a a d' (Jt a a =al( 1 - = <A a a a = N a (since <A a <A a a a a a = 0). 

The same would be true of the eigenvalues n a . Hence, nh = n a , which means that n a — 0 or 1. 
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In passing, we note that in the case of fermions, operation (33) has meaning only if n a — 1 
and operation (34) has meaning only if n a — 0. 

Finally, the substitution of expressions (18) and (19) into (4) gives for the Hamiltonian 
operator of the field 



2m 


J](a|V 2 | P)alcif> + ^ J2 (a/3\u\y X) alcipCiy ax, 

ot,fi a,p,y,X. 


(37) 


where 

<or | V 2 1 /?) = J d 3 ru*(r)V 2 up(r) 


and 


(38) 


(ap\u\yX)= jj d 3 rid 3 r 2 u* a (ri)u*p(r 2 )ui 2 Uy(r 2 )Ux(ri). (39) 

Now, if the single-particle wavefunctions are chosen to be 

u a(r)= -^ elPa ' r/h , (40) 

where p a denotes the momentum of the particle (assumed “spinless”), then the matrix 
elements (38) and (39) become 

( 2 \ 2 

_ p A e i pp r,h = _ p ^ &a p (41) 

and 

(ap\u\yk) = ^ 2 jj d 3 r 1 d 3 r 2 e- i ^-P^ h u(r 2 - r 1 )e~ Up f l - p y ) ' r2/h . (42) 

In view of the fact that the total momentum is conserved in each collision, 

Pa+Pfi=Py+PX' ( 43 ) 

the matrix element (42) takes the form 

(af}\u\yX) — -^2 JJ d 3 rid 3 r 2 e l{p v- p P ){r2 ~ ri)lh u(r 2 -r\) 

= ^ J d 3 re lpr/n u(r), (44) 

where p denotes the momentum transfer during the collision: 


P=(Py-Pf,)=-(Px-P a )- 


( 45 ) 
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Substituting (41) and (44) into (37), we finally obtain 



( 46 ) 


where ify*p 2 denotes the matrix element (44), with 


P=(P2-P , 2') = -(P\ -p'i); 


(47) 


note that the primed summation in the second term of (46) goes only over those values of 


P\PiP \, and p' 2 that conserve the total momentum of the particles: p\ + p' 2 = p l + p 2 . It is 


obvious that the main term in (46) represents the kinetic energy of the field (nj,n p being the 


particle-number operator pertaining to the single-particle state p), while the second term 
represents the potential energy. 

In the case of spin-half fermions, the single-particle states have to be characterized not 
only by the value p of the particle momentum but also by the value o of the z-component of 
its spin; accordingly, the creation and annihilation operators would carry double indices. 
The operator H then takes the form 



(48) 


the summation in the second term now goes only over those states (of the two particles) 
that conform to the conditions of both momentum conservation and spin conservation. 

In the following sections we shall apply the formalism of second quantization to inves¬ 
tigate low-temperature properties of systems composed of interacting particles. In most 
cases we shall study these systems under the approximating conditions a/X 1 and 
na 3 1, where a is the scattering length of the two-body interaction, X the mean thermal 
wavelength of the particles, and n the particle density in the system. Now, the effective 
scattering cross-section for the collision of two particles, each of mass m, is primarily 
determined by the “scattering amplitude’’ a(p), where 


a(p) = f u(r)e ip r l h d 3 r, 


(49) 


p being the momentum transfer during the collision; if the potential is central, equa¬ 
tion (49) takes the form 


OO 



m f 


(50) 


0 


For low-energy scattering (which implies “slow” collisions), we have the limiting result 



( 51 ) 
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the quantity a being the scattering length of the given potential. 4 Alternatively, one may 
employ the S-wave scattering phase shift iio(k), see Section 10.5, and write on one hand 

OO y 

, mk C sin (kr) ? , 

tan;;o(A:) ~ j u(r) (fa . )2 4 nr dr (52) 

o 


and on the other 


cot;? 0 (k) = -— + -kr* + ---, (53) 

ka 2 

where a is the “scattering length’’ and r* the “effective range” of the potential. For low- 
energy scattering, equations (52) and (53) once again lead to (51). In passing, we note that 
a is positive or negative according as the potential in question is predominantly repulsive 
or predominantly attractive; unless a statement is made to the contrary, we shall assume a 
to be positive. 


11.2 Low-temperature behavior of an imperfect 
Bose gas 

The Hamiltonian of the quantized field for spinless bosons is given by the expression 
(11.1.46), where the matrix element Upj’pi; is a function of the momentum p transferred 
during the collision and is given by formula (11.1.44). At low temperatures the particle 
momenta are small, so we may insert for the matrix elements u(p) their value at p = 0, 
namely Uo/V, where uq is given by equation (11.1.51). At the same time, we may retain 
only those terms in the sum J2' that pertain to a vanishing momentum transfer. We then 
have 


H = ala a + 

9m P p 


2 m 

p 

+ E ( 

Pl^P2 


2nah 2 
mV 


P 


it t t ' 

a pi (t p dp,cip^ ! f/p f/p. cipya Pl 


( 1 ) 


Now 


^flpflpflp( 2 p = ^flp(flpflp - l)flp = ^(«p- n p ) = Y2 n p-N, (2) 

p p P P 

4 This result is consistent with the pseudopotential approach of Huang and Yang (1957) in which u(r) is replaced by 
the singular potential (4 nah 2 /m)&(r), so the integral Ug becomes 4naH 2 /m. For an exposition of the pseudopotential 
approach, see Chapter 10 of the first edition of this book. 
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whereas 


Z 4, a k a P2 a Pi= E n Pi n Pi = E n Pi {N - n Pi )= Nl - E n P ■ (3) 

Pl^P2 Pl^Pl Pi P 

the same being true of the sum over the exchange terms a) p a) p ap 2 a p r Collecting these 
results, the energy eigenvalues of the system turn out to be 

2N 2 - N-J2 n2 p 

P 


P 

E{n P } =E n P7^ + 


|2 2 nah 2 

2m ' mV 



2 nah 2 
mV 


(2N 2 - n 2 ). 


(4) 5 


We first examine the ground state of the given system, which corresponds to the dis¬ 
tribution set 


| At for p = 0 
10 for p^ 0, 


(5) 


with the result that 


E 0 - 


2nah 2 N 2 

mV 


( 6 ) 


The ground-state pressure is then given by 


/ 3 Eq\ 2nah 2 N 2 

(vWZ" mV 2 


2jrah 2 n 2 
m 


(7) 


where n(= N/V ) is the particle density in the system. This leads to the velocity of sound, 
c 0 , given by 


9 1 dPn 4nah 2 n 

c 2 =--E = - 5 —. (8) 

m, an m z 

Inserting numbers relevant to liquid He 4 , namely a ~ 2.2A, n= l/v where v ~ 45 A 3 and 
6.65 x 10 _24 g, we obtain: Co — 125m/s. A comparison with the actual velocity of 
sound in the liquid, which is about 240 m/s, should not be too disheartening, for the theory 

5 In the last step we have replaced the sum J2p n p by the single term njj, thus neglecting the partial sum J2p^o n p ' n 
comparison with the number (2 N 2 — rifa). Justification for this step lies in the fact that, by the theory of fluctuations, the 
neglected part here will be O(N), and not 0(N 2 ). 
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developed here was never intended to be applicable to a liquid. Finally, the chemical 
potential of the system at T = 0 K turns out to be 

/ dEn\ Anah 2 N 4nah 2 n 

fl0= \w) v = mV m ' (9) 

At finite but low temperatures, the physical behavior of the system maybe studied through 
its partition function 


Q(N, V,T) = J2 exp (~pE[n p }) 

i n P } 


= E ex p 

{ n p ) 




2nah 2 N 2 
mV 


2 - 


N 2 


( 10 ) 


In the lowest approximation, the quantity (rio/N) appearing here may be replaced by its 
ideal-gas value, as given in Section 7.1, namely 


«o , 

N X 3 
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V C 


X = 
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(: 2nmkT) 1 ! 2 


, X c ={vt;(3/2)}'/ 3 


X 6 


V 

V — — , Vr = 

N 4TC3/2) 


(11a) 

(lib) 


We thus obtain, to first order in a, 


, 2nah 2 N 2 ( 2v v 2 

In QOV, L, T) ~ In Q id (N, V,T)-f$ 1 + --^ 

mV \ zy zz? 


( 12 ) 


The Helmholtz free energy, per particle, is then given by 6 

1 kT, 1 2nah 2 (\ 2 v 

-A(N, V, T ) = - In QOV, V, T) ~ A id (N, V, T) +-( - +- 2 

N N N m \v v c v 2 


(13) 


The pressure P and the chemical potential /x now follow straightforwardly: 


T=-l^ =- 

) NT 


/ d (A/N) \ 

V dv ) T 


2nah 2 ( 1 1 , 

— P id 3--— ( “9 + I > 


m V v Vi 


(14) 


and 


A n 
V — + P v — I 2 id + 


An ah 2 /I 1 


v cJ 


(15) 


which may be compared with the ground-state results (7) and (9) that pertain to v c — oo. 


6 This and the subsequent results were first derived by Lee and Yang (1958, 1960c) using the binary collision method 
and by Huang (1959, 1960) using the pseudopotential method. 
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At the transition point (where v~v c and k = k c ), the pressure P c and the chemical 
potential p c turn out to be 


Pc — Pid + 


An ah 2 
mXc 




(16) 


and 


8nah 2 

M-c = Mid H-? 

mk% 




the corresponding value of the fugacity, z c , is given by 


(17) 


z c = exp(/x c /fcT c ) ~ 1 + 4((3/2)(a/k c ). (18) 

For a slightly different approach to this result, see Problem 11.2. 


11.2.A Effects of interactions on ultracold atomic 
Bose-Einstein condensates 

In Section 7.2 we discussed Bose-Einstein condensation of noninteracting bosons con¬ 
fined in magnetic traps. The low-energy interactions between atoms are described by the 
scattering length a, see equation (11.1.51), and the effect of the scattering length on the 
Bose-condensed ground state of a uniform gas is described by equations (6) through (9). 
We can include the effect of atomic interactions on the spatially nonuniform ground state 
using the Gross-Pitaevskii equation; see Pitaevskii (1961), Gross (1961, 1963), Pitaevskii 
and Stringari (2003), and Leggett (2006). The magnetic trap potential can be approximated 
by an anisotropic harmonic oscillator potential 

V(r) = ^m(a> 2 x 2 + ct) 2 y 2 + a>fz 2 ), (19) 

which leads to the unperturbed single-particle ground-state wavefunction 




1 / x 2 y 2 z 2 \ 
2\a 2 + a 2 + a 2 ) 


( 20 ) 


where a a — ^/h/rri(o a is the linear size of the unperturbed harmonic oscillator ground state 
in Cartesian direction a. 

For the noninteracting case at T — 0, all the N atoms in the trap occupy this same single¬ 
particle state to form a macroscopic quantum state 'l'(r) = y/N<j>(r). At low energies, the 
interactions can be approximated by an effective contact potential uqS (r — r') with scat¬ 
tering length a and coupling u 0 — Anah 2 /rn. This provides a fairly accurate description 
of interactions in ultracold gases since the kinetic energies and densities of the particles 
are so small; see equation (11.1.51). If the scattering length a is positive, the interaction is 
repulsive while if a is negative the interaction is attractive. For atoms, the scattering length 
is normally of the order of a few Bohr radii but in some atomic isotopes the scattering 
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length can be tuned over a large range, including a change of sign, with only small changes 
in the magnetic field via a Feshbach resonance. When interactions are included, the mean 
field energy can be written in terms of the macroscopic quantum-state wavefunction 
'L(r), where the macroscopic ground-state number density is given by n(r) — |^(r)| 2 . The 
Gross-Pitevskii energy functional then is 


EW = / dr 


!■ 


h 2 

2m 


|V4qr)| 2 + V(r)|'I'(r)|" 


+ 2 M o l'L(r)f 


( 21 ) 


see Bahm and Pethick (1996), Pitaevskii and Stringari (2003), Leggett (2006), and Pethick 
and Smith (2008). The energy E[T] can be minimized with respect to T*, with the 
constraint 


N = J n(r)dr = j |'I'(r)| 2 dr, (22) 

using a Lagrange multiplier /i. Setting SE - /j,8N — 0 gives the Gross-Pitaevskii equation 

h 2 

-V 2 »L(r) + P(r)<T(r) + u 0 |T(r)[ 2 4qr) = ^(r). (23) 

2m 

Equations (21) and (23) are quite appropriate for describing the zero-temperature 
nonuniform Bose gas when the scattering length a is much smaller than the average spac¬ 
ing between the particles. Equation (23) is in the form of a single-particle Schrodinger 
equation with the addition of a nonlinear term proportional to nol'k(r)| 2 that gives a 
mean field coupling of one particle to all the remaining particles in the condensate. To 
determine the mean field ground-state energy, one solves equation (23) for 'k(r) and uses 
that solution to evaluate (21). When a = 0, the solution is the noninteracting ground- 
state wavefunction 'k(r) = y^N(p(r). The dimensionless parameter that controls the size 
of the interaction term is Na/a osc where a osc = (f/i flails) 1 ' 33 , so the effects of interactions 
are largest in systems with large numbers of atoms in the condensate. 

Equation (23) can be analyzed analytically in several limiting cases; it can also be stud¬ 
ied numerically. In particular, solving equations (21) and (23) for a uniform system with 
V — 0 reproduces the results given in equations (6) through (9). For a > 0, the conden¬ 
sate wavefunction expands in every direction relative to the noninteracting Bose-Einstein 
condensate. For Na/a osc » 1, the kinetic energy term can be neglected in the style of a 
Thomas-Fermi analysis, so the wavefunction becomes approximately 

't'(r)«V(/i-V r (r))/u 0 , (24) 

where the Thomas-Fermi wavefunction vanishes for V(r) > /i. The chemical potential // 
and the number of bosons N are related by 



mu>Q J 


3/2 


Uo 


(25) 
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where <mq = (&>i&> 2 « 3 ) 1 / 3 , so 


M = 



(26) 


The total energy of the condensate in this limit is 



(27) 


The linear extents of the condensate in the three directions of the trap are given by 



(28) 


so the repulsive interactions expand the size of the condensate, making the aniostropy of 
the system larger than that of the noninteracting Bose-Einstein condensate; see Pitaevskii 
and Stringari (2003); Leggett (2006), and Pethick and Smith (2008). In time-of-flight mea¬ 
surements, the repulsive interactions result in higher velocities in the directions that were 
most confined in the trap, so the time-of-flight distributions are also more anisotropic than 
in the noninteracting case; see Holland and Cooper (1996) and Holland et al. (1997). 

For attractive interactions with negative scattering lengths, the condensate is ultimately 
unstable because of the formation of pairs, but a long-lived atomic condensate exists for 
small negative Na/a osc . For the isotropic case, the atomic condensate is metastable in the 
mean field theory for —0.575 < Na/a osc < 0. Even in the anisotropic case, the condensate is 
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FIGURE 11.1 Ground-state energy of a Bose-Einstein condensate in an isotropic harmonic trap as a function of the 
scattering length a. The energy is plotted in units of Nha. >o, where mq is the trap frequency. The "nonlinear 
constant" is proportional to Na/a osc , where N is the number of atoms and a 0sc = Jh/ma >o is the width of the 
harmonic oscillator ground state wavefunction. Figure from Ruprecht et al. (1995). Reprinted with permission; 
copyright © 1995, American Physical Society. 
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nearly spherical since the solution of equation (23) is dominated by the kinetic and inter¬ 
action terms; see Ruprecht et al. (1995) and Figure 11.1. Roberts et al. (2001) have used 
a Feshbach resonance to tune the scattering length of 85 Rb to find that the condensate 
becomes unstable at N\a\/a osc — 0.46. 

11.3 Low-lying states of an imperfect Bose gas 

In the preceding section we examined first-order corrections to the low-temperature 
behavior of an imperfect Bose gas arising from interparticle interactions in the system. 
One important result emerging in that study was a nonzero velocity of sound, as given 
by equation (11.2.8). This raises the possibility that phonons, the quanta of sound field, 
might play an important role in determining the low-temperature behavior of this system 
— a role not seen in Section 11.2. To look into this question, we explore the nature of the 
low-lying states of an imperfect Bose gas, in the hope that we thus discover an energy- 
momentum relation sip) obeyed by the elementary excitations of the system, of which 
phonons may be an integral part. For this, we have to go a step beyond the approximation 
adopted in Section 11.2 which, in turn, requires several significant improvements. To keep 
matters simple, we confine ourselves to situations in which the fraction of particles occu¬ 
pying the state with p — 0 is fairly close to 1 while the fraction of particles occupying states 
with p^ 0 is much less than 1. 

Going back to equations (11.2.1) through (11.2.4), we first write 

2 N 2 -nl = N 2 + (N 2 - n%) ~ N 2 + 2 N(N - n 0 ) = N 2 + 2N ^ a ] p a p . (1) 

p^O 

Next, we retain another set of terms from the sum i n equation (11.1.46) — terms that 
involve a nonzero momentum transfer, namely 


^ ll(p) \Cl p Cl p Cl()Cl{) -(- CIqCIqCI p CL— P \ . ( 2 ) 

p^O 


Now, since a' Q ao = no — O(N) and (aoa f) - a' Q ao) = 1 « N, il follows that aoaj = (no + 1) — 
a' Q ao. The operators ao and a' Q may, therefore, be treated as c-numbers, each equal to n l J 2 ~ 
AT 1 / 2 . At the same time, the amplitude u(p) in the case of low-lying states may be replaced 
by u 0 /V, as before. Expression (2) then becomes 

U 0 N yr —> + + 

— y~ / “ 1 “ a p a —p)' 

P* o 


(3) 
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In view of these results, the Hamiltonian of the system assumes the form 


TT X -' P 

H = > —— a„a v 
^ 2m p p 


u 0 

2V 


N “l - iV (2a p a p -l- a p ci _ p -l- a p ci _ p ) 

Pit o 


(4) 


Our next task consists of determining an improved relationship between the quantity 
u 0 and the scattering length a. While the (approximate) result stated in equation (11.1.51) 
is good enough for evaluating the term involving NJ 2 P ^ 0 ’ it i s n °t so for evaluating the 
term involving N 2 . For this, we note that “if the probability of a particular quantum tran¬ 
sition in a given system under the influence of a constant perturbation V is, in the first 
approximation, determined by the matrix element Vj, then in the second approximation 
we have instead 


<+E 

0 


V°V n 

v n 

Eq — E n ’ 


the summation going over the various states of the unperturbed system.” 

In the present case, we are dealing with a collision process in the two-particle system 
(with reduced mass \m), and the role of Vj is played by the quantity 

u oo = ^f w<r)d 3 r=y; 

see equation (11.1.44) for the matrix element u p \[ p 2 2 ■ Making use of the other matrix ele¬ 
ments, we find that in going from the first to second approximation, we have to replace 
iio/Vby 


y + v 2 2 - 


\ J cpre lpr/h u(r)\ 2 u 0 u^m 


p^o 


—p z /m 
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V V 2 t 


(5) 


Equating (5) with the standard expression An ah 2 /mV, we obtain, instead of (11.1.51), 


M 0 ~ 
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Anah 2 
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( 6 ) 


Substituting (6) into (4), we get 

2nah 2 N 2 
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(7) 
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To evaluate the energy levels of the system one would have to diagonalize the Hamilton¬ 
ian (7), which can be done with the help of a linear transformation of the operators a p and 
af, first employed by Bogoliubov (1947): 


bp — 


Up + a,pU_p 

V(l-»p) 


bp — 


Up + CXpU—p 

yo - op) 


where 


with 


mV ( 4jt ah 2 N p 2 

" p = An ah' 2 -N j mV + 2m~ ® (P) 


sip) = 
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1/2 


( 8 ) 


(9) 


( 10 ) 


clearly, each a p < 1. Relations inverse to (8) are 


Up — 


bp — cipb^p 

V(1 -“p) 



b p — ctpb—p 

V(l-a#) 


( 11 ) 


It is straightforward to check that the new operators b p and b p satisfy the same commuta¬ 
tion rules as the old operators a p and a p did, namely 

lbp,b p ,}=8 p p, (12a) 

[b p , b p f] = [b l pt b p ,\ = 0. (12b) 

Substituting (11) into (7), we obtain our Hamiltonian in the diagonalized form: 


H = E 0 + J2 e( P )b pbp’ 

p^O 


(13) 


where 


£o = 
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mV 
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s(P)~n - 

2m 
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mV 




mV 
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(14) 


In view of the commutation rules (12) and expression (13) for the Hamiltonian oper¬ 
ator H, it seems natural to infer that the operators b p and b p are the annihilation and 
creation operators of certain “quasiparticles” — which represent elementary excitations of 
the system — with the energy-momentum relation given by (10); it is also clear that these 
quasiparticles obey Bose-Einstein statistics. The quantity b p bp is then the particle-number 
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operator for the quasiparticles (or elementary excitations) of momentum p, whereby the 
second part of the Hamiltonian (13) becomes the energy operator corresponding to the 
elementary excitations present in the system. The first part of the Hamiltonian, given 
explicitly by equation (14), is therefore the ground-state energy of the system. Replacing 
the summation over p by integration and introducing a dimensionless variable x, defined 
by 


x = p 


V 


y 8jtah 2 N^) 

we obtain for the ground-state energy of the system 
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The value of the integral turns out to be (128) 1 / 2 /15, with the result 
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(16) 


where n denotes the particle density in the system. Equation (16) represents the first 
two terms of the expansion of the quantity Eq/N in terms of the low-density parameter 
(na 3 ) 1 / 2 ; the first term was already obtained in Section 11.2. 7 

The foregoing result was first derived by Lee and Yang (1957) using the binary collision 
method; the details of this calculation, however, appeared somewhat later (see Lee and 
Yang, 1960a; see also Problem 11.6). Using the pseudopotential method, this result was 
rederived by Lee, Huang, and Yang (1957). 

The ground-state pressure of the system is now given by 


/ dEp\ _ 2 d(Ep/N) 
V dV) N 3 n 


2nah 2 n 2 
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(17) 


7 The evaluation of higher-order terms of this expansion necessitates consideration of three-body collisions as well; 
hence, in general, they cannot be expressed in terms of the scattering length alone. The exceptional case of a hard-sphere 
gas has been studied by Wu (1959), who obtained (using the pseudopotential method) 
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which shows that the expansion does not proceed in simple powers of (na 3 ) 1 / 2 . 
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from which one obtains for the velocity of sound 


1 dPo 
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Equations (17) and (18) are an improved version of the results obtained in Section 11.2. 

The ground state of the system is characterized by a total absence of excitations; 
accordingly, the eigenvalue of the (number) operator b p b p of the quasiparticles must be 
zero for all p^ 0. As for the real particles, there must be some that possess nonzero ener¬ 
gies even at absolute zero, for otherwise the system cannot have a finite amount of energy 
in the ground state. The momentum distribution of the real particles can be determined 
by evaluating the ground-state expectation values of the number operators a p a p . Now, in 
the ground state of the system, 
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because b p \Ai 0 ) = 0. Constructing the hermitian conjugate of (19) and remembering that 
a p is real, we have 
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The scalar product of expressions (19) and (20) gives 
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here, use has been made of the facts that (i) b p b p - b p b p — 1 and (ii) in the ground state, 
for all p V 0, b p b p = 0 (and hence b p b p = 1). Thus, for p / 0, 


a p x 2 + l 1 

1 -a 2 = 2Xy/(X 2 + 2) ~~ 2’ 


( 22 ) 


where x = p(8itah 2 n) b 2 . The total number of “excited” particles in the ground state of 
the system is, therefore, given by 
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The value of the integral turns out to be (2) 1 / 2 /3, with the result 

® (24) 

p^o n 


Accordingly, 
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The foregoing result was first obtained by Lee, Huang, and Yang (1957), using the pseu¬ 
dopotential method. It may be noted here that the importance of the real-particle occupa¬ 
tion numbers n p in the study of the ground state of an interacting Bose system had been 
emphasized earlier by Penrose and Onsager (1956). 


11.4 Energy spectrum of a Bose liquid 

In this section we propose to study the most essential features of the energy spectrum of a 
Bose liquid and to examine the relevance of this study to the problem of liquid He 4 . In this 
context we have seen that the low-lying states of a low-density gaseous system composed 
of weakly interacting bosons are characterized by the presence of the so-called elementary 
excitations (or “quasiparticles”), which are themselves bosons and whose energy spectrum 
is given by 


s(p) = {p 2 u 2 + (p 2 /2m) 2 } 1/2 , 


( 1 ) 


where 


u— {An an) 1 ! 2 {h/m)', (2) 

see equations (11.3.10), (11.3.12), and (11.3.13). 8 For p«ma, that is, p <^h(an) l/2 , the 
spectrum is essentially linear: s — pu. The initial slope of the (s,p )-curve is, therefore, given 
by the parameter u, which is identical to the limiting value of the velocity of sound in the 
system; compare (2) with (11.3.18). It is then natural that these low-momentum excita¬ 
tions be identified as phonons — the quanta of the sound field. For p » mu, the spectrum 
approaches essentially the classical limit: e ~ p 2 /2m + A*, where A* = mu 2 — Ananh 2 /m. 
It is important to note that, all along, this energy-momentum relationship is strictly mono¬ 
tonic and does not display any “dip” of the kind propounded by Landau (1941, 1947) (for 


8 Spectrum (1) was first obtained by Bogoliubov (1947) by the method outlined in the preceding sections. Using the 
pseudopotential method, it was rederived by Lee, Huang, and Yang (1957). 
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liquid He 4 ) and observed experimentally by Yarnell et al. (1959), and by Henshaw and 
Woods (1961); see Section 7.6. Thus, the spectrum provided by the theory of the preceding 
sections simulates the Landau spectrum only to the extent of phonons; it does not account 
for rotons. This should not be surprising, for the theory in question was intended only for 
a low-density Bose gas (na 3 <$; 1) and not for liquid H e 4 (na 3 — 0.2). 

The problem of elementary excitations in liquid He 4 was tackled successfully by 
Feynman who, in 1953 to 1954, developed an atomic theory of a Bose liquid at low tem¬ 
peratures. In a series of three fundamental papers starting from first principles, Feynman 
established the following important results. 9 

(i) In spite of the presence of interatomic forces, a Bose liquid undergoes a phase 
transition analogous to the momentum-space condensation occurring in the ideal 
Bose gas; in other words, the original suggestion of London (1938a,b) regarding 
liquid He 4 , see Section 7.1, is essentially correct. 

(i i) At sufficiently low temperatures, the only excited states possible in the liquid 

are the ones related to compressional waves, namely phonons. Long-range motions, 
which leave the density of the liquid unaltered (and consequently imply nothing 
more than a simple “stirring” of the liquid), do not constitute excited states 
because they differ from the ground state only in the “permutation” of certain 
atoms. Motions on an atomic scale are indeed possible, but they require a minimum 
energy A for their excitation; clearly, these excitations would show up only at 
comparatively higher temperatures (T ~ A jk) and might well turn out to be 
Landau’s rotons. 

(iii) The wavefunction of the liquid, in the presence of an excitation, should be 
approximately of the form 


4 /= *£]/(/•;), 

i 


(3) 


where <J> denotes the ground-state wavefunction of the system while the summation 
of f(ri) goes over all the N coordinates r\, ..., r.y; the wavefunction T is, clearly, 
symmetric in its arguments. The exact character of the function/(r) can be 
determined by making use of a variational principle that requires the energy of the 
state T (and hence the energy associated with the excitation in question) to be a 
minimum. 

The optimal choice for f(r) turns out to be, see Problem 11.8, 

/(r) = expi(fc-r), (4) 


9 The reader interested in pursuing Feynman’s line of argument should refer to Feynman’s original papers or to a 
review of Feynman’s work on superfluidity by Mehra and Pathria (1994). 
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with the (minimized) energy value 


e(fc) = 


h 2 k 2 
2mS{k) ’ 


( 5 ) 


where S(lc) is the structure factor of the liquid, that is, the Fourier transform of the pair 
correlation function g(r ): 


S(k) = 1 + n J (g(r) - 1 )e lk r dr; (6) 

it may be recalled here that the function ng(r 2 — n) is the probability density for finding 
an atom in the neighborhood of the point r 2 when another one is known to be at the point 
n; see Section 10.7. The optimal wavefunction is, therefore, given by 

vl/ = 3>^y' fcr ;. (7) 

i 


Now the momentum associated with this excited state is hk because 

PV = ^-ihJ2^ij'l > = Kk&, (8) 

P<t> being identically equal to zero. Naturally, this would be interpreted as the momentum 
p associated with the excitation. One thus obtains, from first principles, the energy- 
momentum relationship for the elementary excitations in a Bose liquid. 

On physical grounds one can show that, for small k, the structure factor S(k) rises lin¬ 
early as hk/2mc, reaches a maximum near k — 2n/r 0 (corresponding to a maximum in 
the pair correlation function at the nearest-neighbor spacing r 0 , which for liquid He 4 is 
about 3.6 A) and thereafter decreases to approach, with minor oscillations (corresponding 
to the subsidiary maxima in the pair correlation function at the spacings of the next near¬ 
est neighbors), the limiting value 1 for large k; the limiting value 1 arises from the presence 
of a delta function in the expression for g(r) (because, as i ’2 —»■ r\, one is sure to find an 
atom there). 10 Accordingly, the energy e(k) of an elementary excitation in liquid He 4 would 
start linearly as hkc, show a “dip” at ko ~ 2A 1 and rise again to approach the eventual 
limit of h 2 k 2 /2m. u These features are shown in Figure 11.2. Clearly, Feynman’s approach 
merges both phonons and rotons into a single, unified scheme in which they represent 

10 For a microscopic study of the structure factor S(k), see Huang and Klein (1964); also Jackson and Feenberg (1962). 

11 It is natural that at some value of k < ko, the (e,fc)-curve passes through a maximum ; this happens when 
dS/dk = 2 S/k. 
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k, in A 1 


FIGURE 11.2 The energy spectrum of the elementary excitations in liquid He 4 . The upper portion shows the 
structure factor of the liquid, as derived by Henshaw (1960) from experimental data on neutron diffraction. Curve 1 
in the lower portion shows the energy-momentum relationship based on the Feynman formula (5) while curve 2 is 
based on an improved formula by Feynman and Cohen (1956). For comparison, the experimental results of Woods 
(1966) obtained directly from neutron scattering are also included. 


different parts of a common (and continuous) energy spectrum s(k), as determined by the 
structure of the liquid through the function S(k). Since no motion of a rotational character 
is involved here, the name “roton” is clearly a misnomer. 

It seems appropriate to mention here that, soon after the work of London, which advo¬ 
cated a connection between the phase transition in liquid He 4 and the phenomenon of 
Bose-Einstein condensation, Bijl (1940) investigated the mathematical structure of the 
wavefunctions appropriate to an interacting Bose gas and the excitation energy associ¬ 
ated with those wavefunctions. His picture corresponded very closely to Feynman’s and 
indeed led to the wavefunction (7). Bijl also derived an expression for s(k) that was exactly 
the same as (5). Unfortunately, he could not make much use of his results — primarily 
because he leaned too heavily on the expansion 

S(fc) = S(0) + C 2 k 2 + C 4 fc 4 + ---, (9) 

which, as we now know, represents neither phonons nor rotons. 
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11.5 States with quantized circulation 

We now proceed to examine the possibility of “organized motion” in the ground 
state of a Bose fluid. In this context, the most important concept is embodied in the cir¬ 
culation theorem of Feynman (1955), which establishes a physical basis for the existence 
of “quantized vortex motion" in the fluid. In the case of liquid helium II, this concept has 
successfully resolved some of the vital questions that baffled superfluid physicists for a 
long time. 

The ground-state wavefunction of a superfluid, composed of N bosons, may be 
denoted by a symmetric function 0(n,..., rjv); if the superfluid does not partake in any 
organized motion, then $ will be a pure real number. If, on the other hand, it possesses a 
uniform mass-motion with velocity v s , then its wavefunction would be 

vj/ _ (j> gUPs-m/h _ q >e im(ys 'E i r i )/h' 

where P s denotes the total momentum of the fluid and R its center of mass: 

P s = Nmv s ; R = N~ 1 J2 r i- (2) 

i 

The wavefunction (1) is exact if the drift velocity v s is uniform throughout the fluid. If v s 
is nonuniform, then the present wavefunction would still be good locally — in the sense 
that the phase change Af resulting from a “set of local displacements” of the atoms (over 
distances too small for velocity variations to be appreciable) would be practically the same 
as the one following from expression (1). Thus, for a given set of displacements Ar,- of 
the atoms constituting the fluid, the change in the phase of the wavefunction would very 
nearly be 

A 4>= T jt J2 (Vsi ' Ar *)< ® 

i 


where v s is now a function of r. 

The foregoing result may be used for calculating the net phase change resulting from 
a displacement of atoms along a ring, from their original positions in the ring to the 
neighboring ones, so that after displacement we obtain a configuration that is physically 
identical to the one we started with; see Figure 11.3. In view of the symmetry of the wave- 
function, the net phase change resulting from such a displacement must be an integral 
multiple of 2tt (so that the wavefunction after the displacement is identical to the one 
before the displacement): 

jJ^Xv sr Ari) = 2nn, n = 0 ,± 1 ,± 2 ,...; ( 4 ) 

the summation here goes over all the atoms constituting the ring. We note that, for the 
foregoing result to be valid, it is only the individual Ar, that have to be small, not the whole 
perimeter of the ring. Now, for a ring of a macroscopic size, one may regard the fluid as a 
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FIGURE 11.3 The wavefunction of the fluid must not change as a result of a permutation of the atoms. If all the 
atoms are displaced around a ring, as shown, the phase change must be a multiple of 2n. 


continuum; equation (4) then becomes 


v s dr=n —, n = 0,±1,±2,_ 

m 


(5) 


The quantity on the left side of this equation is, by definition, the circulation (of the flow) 
associated with the circuit of integration and is clearly quantized, the “quantum of circu¬ 
lation” being h/m. Equation (5) constitutes the circulation theorem of Feynman; it bears a 
striking resemblance to the quantum condition of Bohr, namely 


pdq = nh, 


( 6 ) 


though the region of application here is macroscopic rather than microscopic. 12 
By Stokes’ theorem, equation (5) may be written as 


/ 


(curl v s ) ■ dS = n —, n= 0,±1,±2,..., 
m 


(7) 


where S denotes the area enclosed by the circuit of integration. If this area is “simply- 
connected” and the velocity v s is continuous throughout the area, then the domain of 
integration can be shrunk in a continuous manner without limit. The integral on the 
left side is then expected to decrease continuously and finally tend to zero. The right 
side, however, cannot vary continuously. We infer that in this particular case the quan¬ 
tum number n must be zero, that is, our integral must be identically vanishing. Thus, 
“in a simply-connected region, in which the velocity field is continuous throughout, the 
condition 


curl v s = 0 (8) 

12 That the vortices in a superfluid may be quantized, the quantum of circulation being h/m, was first suggested by 
Onsager (1949) in a footnote to a paper dealing with the classical vortex theory and the theory of turbulence! 
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holds everywhere.” This is precisely the condition postulated by Landau (1941), which has 
been the cornerstone of our understanding of the hydrodynamic behavior of superfluid 
helium. 13 

Clearly, the Landau condition is only a special case of the Feynman theorem. It is quite 
possible that in a “multiply-connected” domain, which cannot be shrunk continuously to 
zero (without encountering singularities in the velocity field), the Landau condition may 
not hold everywhere. A typical example of such a domain is provided by the flow of a vortex, 
which is a planar flow with cylindrical symmetry, such that 

]£ 

v p = 0, V <P=^ ) ’ v z = 0 ’ O) 

where p is the distance measured perpendicular to the axis of symmetry while K is the 
circulation of the flow: 

® v- dr = ® v#(p d<t)) — K] (10) 


note that the circuit of integration in (10) must enclose the axis of the vortex. 

Another version of the foregoing result is 

j (cm\v) dS= J |~(/0^)j (2npdp) = K. (11) 

s s 

Now, at all p ^ 0, curl v — 0 but at p = 0, where rv/> is singular, curl v appears to be indeter¬ 
minate; it is not difficult to see that, at p — 0, curl v diverges (in such a way that the integral 
in (11) turns out to be finite). In this context, it seems worthwhile to point out that if we 
carry out the integration in (10) along a circuit that does not enclose the axis of the vortex, 
or in (11) over a region that does not include the point p — 0, the result would be identically 
zero. 

At this stage we note that the energy associated with a unit length of a classical vortex 
is given by 


SCI , l K \ 2 mn 0 K 2 , , 

L = J -{2np dp(mn 0 )}\-—\ = ^ \n{h/a). (12) 

a 

13 Drawing on the well-known analogy between the phenomena of superfluidity and superconductivity, and the 
resulting correspondence between the mechanical momentum mv s of a superfluid particle and the electromagnetic 
momentum 2eA/c of a Cooper pair of electrons, we observe that the relevant counterpart of the Landau condition, in 
superconductors, would be 

curlzl = B = 0, (8a) 

which is precisely the Meissner effect-, furthermore, the appropriate counterpart of the Feynman theorem would be 

J BdS=n^, (7a) 

s 


which leads to the "quantization of the magnetic flux," the quantum of flux being hc/2e. 
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Here, {mno) is the mass density of the fluid (which is assumed to be uniform), the upper 
limit b is related to the size of the container while the lower limit a depends on the structure 
of the vortex; in our study, a would be comparable to the interatomic separation. 

In the quantum-mechanical case we may describe our vortex through a self-consistent 
wavefunction \jr(r), which, in the case of cylindrical symmetry, see equation (9), may be 
written as 


*(r) = rf^e^fsip), 

so that 


(13) 


n(r) = \^(r)\ z = n*ff(p). (14) 

As p -> oo ,f s (p) —*■ 1, so that n* becomes the limiting particle density in the fluid in regions 
far away from the axis of the vortex. The velocity field associated with this wavefunction 
will be 


V(r) = T 

2im(xj/*\l/) 

= -V(s0) = fo,s—,oY (15) 14 

m \ mp ) 

Comparing (15) with (9), we conclude that the circulation K in the present case is sh/m ; 
by the circulation theorem, s must be an integer: 

s = 0,±1,±2,.... (16) 

Clearly, the value 0 is of no interest to us. Furthermore, the negative values of 5 differ from 
the positive ones only in the “sense of rotation’’ of the fluid. It is, therefore, sufficient to 
consider the positive values alone, namely 

5= 1,2,3,.... (17) 

The function f s (p) appearing in equation (13) may be determined with the help of a 
Schrodinger equation in which the potential term is itself V/-dependent, namely 

(-^ v2 + M°|i/d 2 )^=eVo (18) 

14 It is of interest to see that the angular momentum per particle in the fluid is given by 

1 /ft 3 \ 


this is again reminiscent of the quantum condition of Bohr. 
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where uq is given by equation (11.1.51): 


Uo = 4nah 2 /m, (19) 

a being the scattering length of the interparticle interaction operating in the fluid. The 
characteristic energy e follows from the observation that, at large distances from the axis 
of the vortex, the fluid is essentially uniform in density, with n(r) -» n*; equation (18) then 
gives 

s = UqU* — Anah 2 n*/m, (20) 


which may be compared with equation (11.2.9). Substituting (20) into (18) and remember¬ 
ing that the flow is cylindrically symmetrical, we get 


1 d 




P , UP) l- yfs(p) 


p dp 1 dp' 

Expressing p in terms of a characteristic length l, 

p = Ip' (l = {8nan*)~^i 2 }, 


+ 8nan*ff(p) — 8jt an*f s (p). 


( 21 ) 


( 22 ) 


we obtain 


d 2 f s 1 df s 

dp' 2 p' dp' 


-a)fs-fs= 0 . 


(23) 


Toward the axis of the vortex, where p 0, the very high velocity of the fluid (and the very 
large centrifugal force accompanying it) will push the particles outward, thus causing an 
enormous decrease in the density of the fluid. Consequently, the function/ s should tend to 
zero as p -»■ 0. This will make the last term in equation (23) negligible and thereby reduce 
it to the familiar Bessel’s equation. Accordingly, for small p, 


fs(P ) ~Js(p ) ~ p S } 


(24) 


J s being the ordinary Bessel function of order .s'. For p' » 1 ,f s ~ 1; then, the first two terms 
of equation (23) become negligible, with the result 

fs(p') — 1 — ^2 ■ (25) 

The full solution is obtained by integrating the equation numerically; the results so 
obtained are shown in Figure 11.4 where solutions for s = 1,2, and 3 are displayed. 

We thus find that our model of an imperfect Bose gas does allow for the presence of 
quantized vortices in the system. Not only that, we do not have to invoke here any special 
assumptions regarding the nature of the “core” of the vortex (as one has to do in the classi¬ 
cal theory); our treatment naturally leads to a continual diminution of the particle density 
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FIGURE 11.4 Solutions of equation (23) for various values of s (after Kawatra and Pathria, 1966). 


n as the axial line is approached, so that there does not exist any specific distribution of 
vorticity around this line. The distance scale, which governs the spatial variation of n, is 
provided by the parameter l of equation (22); for liquid He 4 , l ~ 1 A. 

Pitaevskii (1961), who was among the first to demonstrate the possibility of obtaining 
solutions whose natural interpretation lay in quantized vortex motion (see also Gross, 
1961; Weller, 1963), also evaluated the energy per unit length of the vortex. Employ¬ 
ing wavefunction (13), with known values of the functions f s (p), Pitaevskii obtained the 
following results for the energy per unit length of the vortex, with s = 1,2, or 3, 

71^ h 2 

^-{lln( 1.467?/Z), 4ln(0.597?/Z), 9ln(0.387?/Z)|, (26) 

4 nm' 1 

where R denotes the outer radius of the domain involved. The above results may be 
compared with the “semiclassical” ones, namely 

|^{lln(7 R/a), 4ln(7?/fl), 9ln(7?/a)}, (27) 

which follow from formula (12), with K replaced by sh/m and b by R. It is obvious 
that vortices with s> 1 would be relatively unstable because energetically it would be 
cheaper for a system to have s vortices of unit circulation rather than a single vortex of 
circulation s. 

The existence of quantized vortex lines in liquid helium II has been demonstrated con¬ 
vincingly by the ingenious experiments of Vinen (1958-1961) in which the circulation K 
around a fine wire immersed in the liquid was measured through the influence it exerts on 
the transverse vibrations of the wire. Vinen found that while vortices with unit circulation 
were exceptionally stable those with higher circulation too made an appearance. Repeat¬ 
ing Vinen’s experiment with thicker wires, Whitmore and Zimmermann (1965) were able 
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to observe stable vortices with circulation up to three quantum units. For a survey of this 
and other aspects of the superfluid behavior, see Vinen (1968) and Betts et al. (1969). Kim 
and Chan (2004) have even observed a “supersolid” phase of helium-4 at low temperatures 
that has the crystalline structure of a solid while also exhibiting superfuid-like flow. 

For the relevance of quantized vortex lines to the problem of “rotation” of the super¬ 
fluid, see Section 10.7 of the first edition of this book. 


11.6 Quantized vortex rings and the breakdown 
of superfluidity 

Feynman (1955) was the first to suggest that the formation of vortices in liquid helium II 
might provide the mechanism responsible for the breakdown of superfluidity in the liquid. 
He considered the flow of liquid helium II from an orifice of diameter D and, by tentative 
arguments, found that the velocity Vo at which the flow energy available would just be 
sufficient to create quantized vortices in the liquid is given by 

co = —f \n(D/l). (1) 

mD 

Thus, for an orifice of diameter 10 -5 cm, v 0 would be of the order of 1 m/s. 15 It is tempting 
to identify c 0 with v c , the critical velocity of superflow through the given capillary, despite 
the fact that this theoretical estimate for v 0 is an order of magnitude higher than the cor¬ 
responding experimental values of v c ; the latter, for instance, are 13 cm/s, 8 cm/s, and 
4 cm/s for capillary diameters 1.2 x 10 -5 cm, 7.9 x 10 _5 cm, and 3.9 x 10 _4 cm, respec¬ 
tively. Nevertheless, the present estimate is far more desirable than the prohibitively large 
ones obtained earlier on the basis of a possible creation of phonons or rotons in the liquid; 
see Section 7.6. Moreover, one obtains here a definitive dependence of the critical veloc¬ 
ity of superflow on the width of the capillary employed which, at least qualitatively, agrees 
with the trend seen in the experimental findings. In what follows, we propose to develop 
Feynman’s idea further along the lines suggested by the preceding section. 

So far we have been dealing with the so-called linear vortices whose velocity field pos¬ 
sesses cylindrical symmetry. More generally, however, a vortex line need not be straight — 
it may be curved and, if it does not terminate on the walls of the container or on the free 
surface of the liquid, may close on itself. We then speak of a vortex ring, which is very much 
like a smoke ring. Of course, the quantization condition (11.5.5) is as valid for a vortex ring 
as for a vortex line. However, the dynamical properties of a ring are quite different from 
those of a fine; see, for instance, Figure 11.5, which shows schematically a vortex ring in 
cross-section, the radius r of the ring being much larger than the core dimension l. The 
flow velocity v s at any point in the field is determined by a superposition of the flow velo¬ 
cities due to the various elements of the ring. It is not difficult to see that the velocity field 

15 We have taken here: / ~ 1 A, so that In (D/l) ~ 7. 
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FIGURE 11.5 Schematic illustration of a quantized vortex ring in cross-section. 


of the ring, including the ring itself, moves in a direction perpendicular to the plane of the 
ring, with a velocity 16 


v ~ h/2mr, ( 2 ) 

see equation (11.5.15), with 5= 1 and p ~ 2r. An estimate of the energy associated with the 
flow may be obtained from expression (11.5.12), withi = 2jtr,K = h/m, and b ~ r; thus 

e ~ 2n 2 h 2 riQmr 1 r\n(r/1). (3) 

Clearly, the dependence of s on r arises mainly from the factor r and only slightly from 
the factor ln(r/Z). Therefore, with good approximation, v cx e _1 , that is, a ring with larger 
energy moves slower! The reason behind this somewhat startling result is that firstly the 
larger the ring the larger the distances between the various circulation-carrying elements 
of the ring (thus reducing the velocity imparted by one element to another) and secondly 
a larger ring carries with it a larger amount of fluid (M oc r 3 ), so the total energy associated 
with the ring is also larger (essentially proportional to Mr 2 , i.e., cx r). The product vs, apart 
from the slowly varying factor ln(r/Z), is thus a constant, which is equal to n: 2 h 3 n 0 /m 2 . 

It is gratifying that vortex rings such as the ones discussed here have been observed 
and the circulation around them is found to be as close to the quantum h/m as one could 
expect under the conditions of the experiment. Figure 11.6 shows the experimental results 
of Rayfield and Reif (1964) for the velocity-energy relationship of free-moving, charge¬ 
carrying vortex rings created in liquid helium II by suitably accelerated helium ions. Vortex 
rings carrying positive as well as negative charge were observed; dynamically, however, 
they behaved alike, as one indeed expects because both the velocity and the energy asso¬ 
ciated with a vortex ring are determined by the properties of a large amount of fluid carried 

16 This result would be exact if we had a pair of oppositely directed linear vortices, with the same cross-section as 
shown in Figure 11.5. In the case of a ring, the velocity would be somewhat larger. 
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FIGURE 11.6 The velocity-energy relationship of the vortex rings formed in liquid helium II (after Rayfield and Reif, 
1964). The points indicate the experimental data, while the curve represents the theoretical relationship based on 
the "quantum of circulation" h/m. 


along with the ring rather than by the small charge coupled to it. Fitting experimental 
results with the notion of the vortex rings, Rayfield and Reif concluded that their rings 
carried a circulation of (1.00 ± 0.03) x 10 -3 cm 2 /s, which is close to the Onsager-Feynman 
unit h/m(— 0.997 x 10 _3 cm 2 /s); moreover, these rings seemed to have a core radius of 
about 1.2A, which is comparable with the characteristic parameter l of the fluid. 

We shall now show that the dynamics of the quantized vortex rings is such that their 
creation in liquid helium II does provide a mechanism for the breakdown of superfluidity. 
To see this, it is simplest to consider the case of a superfluid flowing through a capillary of 
radius R. As the velocity of flow increases and approaches the critical value v c , quantized 
vortex rings begin to form and energy dissipation sets in, which in turn brings about the 
rupture of the superflow. By symmetry, the rings will be so formed that their central plane 
will be perpendicular to the axis of the capillary and they will be moving in the direction 
of the main flow. Now, by the Landau criterion (7.6.24), the critical velocity of superflow is 
directly determined by the energy spectrum of the excitations created: 


Vc = (e/P)min- (4) 

We, therefore, require an expression for the momentum p of the vortex ring. In analogy 
with the classical vortex ring, we may take 

p = 2n 2 hnoT 2 , (5) 

which seems satisfactory because (i) it conforms to the general result: v — (de/dp), though 
only to a first approximation, and (ii) it leads to the (approximate) dispersion relation: 
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s cx p 1 ' 2 , which has been separately verified by Rayheld and Reif by subjecting their rings 
to a transverse electric field. Substituting (3) and (5) into (4), we obtain 


v c 


— In (r/l) 
mr 


min 


( 6 ) 


Now, since the /'-dependence of the quantity e/p arises mainly from the factor 1/r, the 
minimum in (6) will be obtained when r has its largest value, namely R, the radius of the 
capillary. We thus obtain 


Vc 


h 

mR 


Inf R/l), 


(7) 


which is very much the same as the original estimate of Feynman — with D replaced by R. 
Naturally, then, the numerical values of v c obtained from the new expression (7) continue 
to be significantly larger than the corresponding experimental values; however, the theory 
is now much better founded. 

Fetter (1963) was the first to account for the fact that, as the radius r of the ring 
approaches the radius R of the capillary, the influence of the “image vortex” becomes 
important. The energy of the flow now falls below the asymptotic value given by (3) by a 
factor of 10 or so which, in turn, reduces the critical velocity by a similar factor. The actual 
result obtained by Fetter was 


ll h 

V c — - 

24 mR 


0.46- 


mR 


( 8 ) 


Kawatra and Pathria (1966) extended Fetter’s calculation by taking into account the 
boundary effects arising explicitly from the walls of the capillary as well as the ones aris¬ 
ing implicitly from the “image vortex"; moreover, in the computation of s, they employed 
actual wavefunctions, obtained by solving equation (11.5.23), rather than the analytical 
approximation employed by Fetter. They obtained 

h 

v c — 0.59-, (9) 


which is about 30 percent higher than Fetter’s value; for comments regarding the “most 
favorable” location for the formation of the vortex ring in the capillary, see the original 
reference of Kowatra and Pathria (1966). 


11.7 Low-lying states of an imperfect Fermi gas 

The Hamiltonian of the quantized field for spin-half fermions (a = or — \) is given by 
equation (11.1.48), namely 

^ = E Jm a P° a P° + 2 


ys „Pl a vP2°2 


Pi'll •P2 n 2 a p' ] „! l a f/ 2 a^ a P2 a 2 a Plcri ’ 


( 1 ) 
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where the matrix elements u are related to the scattering length a of the two-body inter¬ 
action; the summation in the second part of this expression goes only over those states (of 
the two particles) that conform to the principles of momentum and spin conservation. As 
in the Bose case, the matrix elements u in the second sum may be approximated by their 
values at p = 0, that is, 


“Pl°'l>P2 cr 2 — 


Ooq ,0 o' n 

7/ 1 z 

U Qo\ ,0cr 2 ‘ 


( 2 ) 


Then, in view of the antisymmetric character of the product a l>] „ ] a V2 „ 2 , see equa¬ 
tion (11.1.24c), all those terms of the second sum in (1) that contain identical indices 
and <72 vanish on summation over p 1 and p 2 . Similarly, all those terms that contain iden¬ 
tical indices <rj and <7 2 vanish on summation over p\ and p ' 2 . 1 7 Thus, for a given set of 
values of the particle momenta, the only choices for the spin components remaining in 
the sum are 


(i) 

<71 - +\, 

<72 

1. 

— 2’ 

_/ ,1 
<7f — + 2 I 

_/ 1 

a 2~ 2 

(ii) 

CT 1 = +2’ 

<72 

1. 

— 2’ 

_/ 1 
<7i — 2 , 

ct 2 - +2 

(iii) 

„ 1 
<7l — 2’ 

<72 

= +z : 

_/ 1 
CT 1 — 2’ 

CT 2 — + 2 

(iv) 

CT 1 ~ ~ 2’ 

<72 

= +i; 

_/ ,1 
<7f — + 2 I 

/ 1 

a 2~ 2 


It is not difficult to see that the contribution arising from choice (i) will be identi¬ 
cally equal to the one arising from choice (iii), while the contribution arising from choice 
(ii) will be identically equal to the one arising from choice (iv). We may, therefore, write 


h=Y — 

^ 2m 

p,a 


rifin Clftr, + — 2_, 


Cl , Cl , 
Pl+ p 2 - 


(3) 


where 


Uq 

V 


(u 


0+,0- 

0+,0- 


— u 


0 -, 0 + 

0+,0- 


)■ 


(4) 


while the indices + and — denote the spin states a — and a — respectively; the 
summation in the second part of (3) now goes over all momenta that conform to the 
conservation law 


p'l +P 2 = Pl +P2- (5) 

To evaluate the eigenvalues of Hamiltonian (3), we shall employ the techniques of the 
perturbation theory. 

17 PhysicalIy, this means that in the limiting case of slow collisions only particles with opposite spins interact with one 
another. 
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First of all, we note that the main term in the expression for H is already diagonal, and 
its eigenvalues are 


p,a 




( 6 ) 


where n pa is the occupation number of the single-particle state (p,o); its mean value, in 
equilibrium, is given by the Fermi distribution function 


Ylpo - 


z Q 1 exp(p 2 /2mkT) + 1 


(7) 


The sum in (6) maybe replaced by an integral, with the result (see Section 8.1, withg = 2) 

3 kT. 


E (0) = V^-f 5 / 2 (z 0 ), 

where X is the mean thermal wavelength of the particles, 

X = h/(2nmkT )b 2 , 

while /vCzo) is the Fermi-Dirac function 


_ I f x" l dx ^ / i z n 

n u i=i 


( 8 ) 


(9) 


( 10 ) 


the ideal-gas fugacity zq is determined by the total number of particles in the system: 

N = J2 n P<y = F^/ 3 / 2 (2o)- 


p,a 


(ID 


The first-order correction to the energy of the system is given by the diagonal elements 
of the interaction term, namely the ones for which p\ — p l and p' 2 = p 2 , thus 


E m = “o y n n = l — N + N -, 

y P 1 + P 2 ~ y ’ 

PvP 2 


( 12 ) 


where N + (N ) denotes the total number of particles with spin up (down). Substituting the 
equilibrium values N+ = N~ = \N, we obtain 




(13) 


Substituting u 0 ~ An ah 2 /m, see equation (11.1.51), we obtain to first order in a 

ni nah 2 N 2kT / a\ ,9 

E “ =v^r N - l 'ir(dlfe <z » ) l ■ |14 > 
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The second-order correction to the energy of the system can be obtained with the help of 
the formula 


7(2) 


= £. 

m^n 


I Vnm\ 
Erj E n 


(15) 


where the indices n and m pertain to the unperturbed states of the system. A simple 
calculation yields: 


E ( 2 ) = 2 “cl y 

v 2 2 — 1 

Pl'P2 »Pj 


n Pl+ n P2 - (1 -n, 


(t- Vi+ ) (l -v 2 ) 


pf+ pI - Pi - p 2 )/ 2m 


(16) 


where the summation goes over all p v p 2 , and p\ (the value of p' 2 being fixed by the 
requirement of momentum conservation); it is understood that we do not include in the 
sum (16) any terms for which p 2 + p 2 = pf + pf. It will be noted that the numerator of 
the summand in (16) is closely related to the fact that the squared matrix element for the 
transition ip v p 2 ) -»• (p\<p’ 2 ) is directly proportional to the probability that “the states p 1 
and p 2 are occupied and at the same time the states p\ and p' 2 are unoccupied.” 

Now, expression (16) does not in itself exhaust terms of second order in a. A contribution 
of the same order of magnitude arises from expression (12) if for uq we employ an expres¬ 
sion more accurate than the one just employed. The desired expression can be obtained 
in the same manner as in the Bose case; check the steps leading to equations (11.2.5) and 
(11.2.6). In the present case, we obtain 


Anah 2 

mV 


u 0 . 0 u o * 

V + V 2 ^ ( \ 

Pi.P2.Pi \P 2 i+pl~ pf - p 2 /2 m 


from which it follows that 


wo - 


An ah 2 


^ 87t ali, 2 


mV 


Pi.P2Pj (pi +p\~ Pj 2 - Pf^/2r 


(17) 


Substituting (17) into (12), we obtain, apart from the first-order term already given in (14), 
a second order term, namely 



(18) 
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For a comparable term given in (16), the approximation uq — An ah 2 /m is sufficient, with 
the result 


io\ l Anah 2 

3X Z) = 2 - 




mV 


E 


n. 


Pl+"P2 


«„,(l-n pi+ )(l-« p ,) 


P1.P2.Pi (F\+P 2 - pf - pf j / 2 m 


Combining (18) and (19), we obtain 18 


£ 2 = 4 1 ) + 4 2) = - 2 (^~ 


E 


ipi+ n p 2 - (« p ' + + n p ^ 


) 


P 1 .P 2 .P 1 (pl +P2~ pf - p%j /2m 


(19) 


( 20 ) 


To evaluate the sum in (20), we prefer to write it as a symmetrical summation over 
the four momenta p ] ,p 2 ,p ] , and p' 2 by introducing a Kronecker delta to take care of the 
momentum conservation; thus 


An ah 2 \ 2 ^ n Pi+ n P 2 -y 1 p f 1 + + n p l 2 -) S Pi+p 2 ,p'i+P , 2 

mV ) ^ T~2 9 ^ E ' 

' P 1 .P 2 .Pi.P 2 I p\ + P2~Pi -Pi I/2m 

It is obvious that the two parts of the sum (21), one arising from the factor fy + and the 
other from the factor n p ;_, would give identical results on summation. We may, therefore, 
write 



E 2 = -4 


Anah 2 
mV 


E 


n Pl + n P2 ~ M pi + “Vi +p 2 .P) +P' 2 


P 1 .P 2 .Pi.Pi (p\ + P 2 2 - pf - pf j/2m 


( 22 ) 


The sum in (22) can be evaluated by following a procedure due to Huang, Yang, and 
Luttinger (1957), with the result 19 


E 2 = V 


8 kT 

IF 



F(z 0 ), 


(23) 


where 


F(z 0 ) = 


r " 1 V(rst)(r+s)(r+t)' 


(24) 


18 We have omitted here terms containing a “product of four n’s" for the following reason: in view of the fact that 
the numerator of such terms would be symmetric and the denominator antisymmetric with respect to the exchange 
operation (Pi,p 2 ) ■*+ (pi ,p 2 ), their sum over the variables P 1 .P 2 .P 1 (and p ' 2 ) would vanish identically. 

19 For a direct evaluation of the sum (22), in the limit T -»• 0, see Abrikosov and Khalatnikov (1957). See also 
Problem 11.12. 
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Combining (8), (14), and (23), we obtain to second order in the scattering length a 


E= V k 4 
X 3 


2a .„ . 2 8a 2 


3/5/2(20) + —1/3/2(20)) + -^2~F(Zo) 


(25) 


where zq is determined by (11) . 

It is now straightforward to obtain the ground-state energy of the imperfect Fermi gas 
(zo -»• 00); we have simply to know the asymptotic behavior of the functions involved. For 
the functions f v (zo), we have from the Sommerfeld lemma (see Appendix E) 


/„(zd)«(lnzo)7r(v + l), 


(26) 


so that 


/5/2(2o) ~ ^ 1/2 (lnz 0 ) 5/2 ; /a /2 ( 20 ) ~ ^J2 (ln2 0 ) 3/2 - (27) 

Equation (11) then gives 


so that 



8 

37r!/2^3 


(lnz 0 ) 3/2 , 


lnz 0 : 


/ 3 ^ 1 / 2„\ 2/3 


(28) 


(29) 


The asymptotic behaviour of F(zq) is given by 

16(11 -2ln2) 


F(z 0 ) ■ 


105tt 3 / 2 


(lnz 0 ) 7/2 ; 


(30) 


see Problem 11.12. Substituting (27) and (30) into (25), and making use of relation (29), we 
finally obtain 


9 9/0 7 r ati I 6 , (3 \ 

(3 n 2 n) 2/3 + - n\ 1+ —(ll-21n2) - 

m I 35 \ n / 


E 0 _ 3 h 2 
N 10 m 


The ground-state pressure of the gas is then given by 

2 d(E 0 /N) 


1/3 


d/3, 


(31) 


Po = n 


tin 


1 h l 


nah z 


=- (3n ri) ' n + 

5 m m 


8 


\ 1+ — (11 -21n2) n L/3 a\. 


3\ 1/3 


(32) 
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We may also calculate the velocity of sound, which directly involves the compressibility of 
the system, with the result 



3 (mri) 



(33) 


The leading terms of the foregoing expressions represent the ground-state results for 
an ideal Fermi gas, while the remaining terms represent corrections arising from the 
interparticle interactions. 

The result embodied in equation (31) was first obtained by Huang and Yang (1957) by 
the method of pseudopotentials; Martin and De Dominicis (1957) were the first to attempt 
an estimate of the third-order correction. 20 Lee and Yang (1957) obtained (31) on the basis 
of the binary collision method; for the details of their calculation, see Lee and Yang (1959b, 
1960a). The same result was derived somewhat later by Galitskii (1958) who employed the 
method of Green’s functions. 


11.8 Energy spectrum of a Fermi liquid: Landau's 
phenomenological theory 


In Section 11.4 we discussed the main features of the energy spectrum of a Bose liquid; 
such a spectrum is generally referred to as a Bose type spectrum. A liquid consisting of 
spin-half fermions, such as liquid He 3 , is expected to have a different kind of spectrum 
which, by contrast, may be called a Fermi type spectrum. 

Right away we should emphasize that a liquid consisting of fermions may not neces¬ 
sarily possess a spectrum of the Fermi type; the spectrum actually possessed by such a 
liquid depends crucially on the nature of the interparticle interactions operating in the 
liquid. The discussion here assumes that the interactions are strictly repulsive so that the 
fermions have no opportunity to form bosonic pairs. In the present section, we propose to 
discuss the main features of a spectrum which is characteristically of the Fermi type. The 
effects of attractive interactions will be discussed in Section 11.9 

According to Landau (1956), whose work provides the basic framework for our discus¬ 
sion, the Fermi type spectrum of a quantum liquid is constructed in analogy with the 
spectrum of an ideal Fermi gas. As is well-known, the ground state of the ideal Fermi 
gas corresponds to a “complete filling up of the single-particle states with p<Pf and a 
complete absence of particles in the states with p > pp"-, the excitation of the system corre¬ 
sponds to a transition of one or more particles from the occupied states to the unoccupied 
states. The limiting momentum pp is related to the particle density in the system and, for 

20 The third-order correction has also been discussed by Mohling (1961). 

21 For a microscopic theory of a Fermi liquid, see Nozieres (1964); see also Tuttle and Mohling (1966). 
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spin-half particles, is given by 

p F = h( 3 n 2 N/V) 1/3 . ( 1 ) 

In a liquid, we cannot speak of quantum states of individual particles. However, as a 
basis for constructing the desired spectrum, we may assume that, as interparticle inter¬ 
actions are gradually “switched on” and a transition made from the gaseous to the liquid 
state, the ordering of the energy levels (in the momentum space) remains unchanged. Of 
course, in this ordering, the role of the gas particles is passed on to the “elementary exci¬ 
tations” of the liquid (also referred to as “quasiparticles”), whose number coincides with 
the number of particles in the liquid and which also obey Fermi statistics. Each “quasipar¬ 
ticle” possesses a definite momentum p, so we can speak of a distribution function n(p) 
such that 

J n(p)dr = N/V, (2) 

where dr = 2 d 3 p/h 3 . We then expect that the specification of the function nip) uniquely 
determines the total energy £ of the liquid. Of course, E will not be given by a simple sum of 
the energies sip) of the quasiparticles; it will rather be afunctional of the distribution func¬ 
tion nip). In other words, the energy £ will not reduce to the simple integral / e(p)n(p)Vdr , 
though in the first approximation a variation in its value may be written as 

SE = V J eip)Sn(p)dr, (3) 

where Snip ) is an assumed variation in the distribution function of the “quasiparticles.” 
The reason £ does not reduce to an integral of the quantity e(p)n(p) is related to the fact 
that the quantity sip) is itself a functional of the distribution function. If the initial distri¬ 
bution function is a step function (which corresponds to the ground state of the system), 
then the variation in sip) due to a small deviation of the distribution function from the 
step function (which implies only low-lying excited states of the system) would be given 
by a linear functional relationship: 

Seip) = J f(p,p')Snip')dT'. ( 4 ) 

Thus, the quantities s(p) and fip, p') are the first and second functional derivatives of £ 
with respect to nip). Inserting spin dependence, we may now write 

SE= 'Y^sip,a)&nip,a) + — fip,o\p',o')&nip,o)&nipl,o'), (5) 

P’ a p,cr,p',a' 

where Sn are small variations in the distribution function nip) from the step function (that 
characterizes the ground state of the system); it is obvious that these variations will be sig¬ 
nificant only in the vicinity of the limiting momentum p F , which continues to be given 
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by equation (1). It is thus understood that the quantity e{p,o) in (5) corresponds to the 
distribution function n(p, a ) being infinitesimally close to the step function (of the ground 
state). One may also note that the function f{p,o\p', a'), being a second functional deriva¬ 
tive of E, must be symmetric in its arguments; often, it is of the form a + bs\ ■ S 2 , where 
the coefficients a and b depend only on the angle between the momenta p and p'. 22 The 
function/ plays a central role in the theory of the Fermi liquid; for an ideal gas, / vanishes. 

To discover the formal dependence of the distribution function n(p) on the energy e(p), 
we note that, in view of the one-to-one correspondence between the energy levels of the 
liquid and of the ideal gas, the number of micro states (and hence the entropy) of the liquid 
is given by the same expression as for the ideal gas; see equation (6.1.15), with all g* = 1 and 
a = +1, or Problem 6.1: 


S 

Tc 


— ^{«ln«+ (1 — «)ln(l — ri)} 
P 



n + (1 — «)ln(l — n)}dx. 


( 6 ) 


Maximizing this expression, under the constraints SE = 0 and SN — 0, we obtain for the 
equilibrium distribution function 


exp{(s — u)/kT} +1 

It should be noted here that, despite its formal similarity with the standard expression 
for the Fermi-Dirac distribution function, formula (7) is different insofar as the quantity 
s appearing here is itself a function of n; consequently, this formula gives only an implicit, 
and probably a very complicated, expression for the function n. 

A word may now be said about the quantity e appearing in equation (5) . Since this e cor¬ 
responds to the limiting case of n being a step function, it is expected to be a well-defined 
function of p. Equation (7) then reduces to the usual Fermi-Dirac distribution function, 
which is indeed an explicit function of e. It is not difficult to see that this reduction remains 
valid so long as expression (5) is valid, that is, so long as the variations Sn are small, which 
in turn means that T <$: 7/. As mentioned earlier, the variation Sn will be significant only 
in the vicinity of the Fermi momentum pf, accordingly, we will not have much to do with 
the function s(p) except when p ~ p F . We may, therefore, write 


s(p~p F ) = s F + I {p-p F )+---~e F +u F (p-p F ), 

d P>P=Pr 


( 8 ) 


where u F denotes the “velocity” of the quasiparticles at the Fermi surface. In the case of an 
ideal gas (e = p 2 /2m), u F = p F /m. By analogy, we define a parameter m* such that 


* Pf 

m = — = 


Pf 


Up ( ds/dp)p=p F 


(9) 


22 Of course, if the functions involved here are spin-dependent, then the factor 2 in the element dx (as well as in dx') 
must be replaced by a summation over the spin variable(s). 
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and call it the effective mass of the quasiparticle with momentum p F (or with p — p F ). 

Another way of looking at the parameter m* is due to Brueckner and Gammel (1958), 
who wrote 


s(p ~ p F ) = 


? ? 

rr p 

- \-V(p) = — -h const.; 

2m 2m* 


( 10 ) 


the philosophy behind this expression is that “for quasiparticles with p — p F , the modifi¬ 
cation, V(p), brought into the quantity s(p) by the presence of inter-particle interactions 
in the liquid may be represented by a constant term while the kinetic energy, p 2 /2m, is 
modified so as to replace the particle mass m by an effective, quasiparticle mass m*”; in 
other words, we adopt a mean field point of view. Differentiating (10) with respect to p and 
setting p — pf, we obtain 


l l | l / dV(p) \ 
m* m p F \ dp ) p=pp 


( 11 ) 


The quantity m*, in particular, determines the low-temperature specific heat of the 
Fermi liquid. We can readily see that, for T <sc 7>, the ratio of the specific heat of a Fermi 
liquid to that of an ideal Fermi gas is precisely equal to the ratio m*/m: 


(Cy )real _ m* 
G-V) ideal rn 


This follows from the fact that (i) expression (6) for the entropy S, in terms of the distri¬ 
bution function n, is the same for the liquid as for the gas, (ii) the same is true of relation 
(7) between 7z and s, and (iii) for the evaluation of the integral in (6) at low temperatures 
only momenta close to p F are important. Consequently, the result stated in Problem 8.13, 
namely 


Cy yfc 2 7Xe F ), (13) 

continues to hold — with the sole difference that in the expression for the density of states 
a(eF), in the vicinity of the Fermi surface, the particle mass m gets replaced by the effective 
mass m*; see equation (8.1.21). 

We now proceed to establish a relationship between the parameters m and m* in terms 
of the characteristic function/. In doing so, we neglect the spin-dependence of/, if any; 
the necessary modification can be introduced without any difficulty. The guiding principle 
here is that, in the absence of external forces, the momentum density of the liquid must 
be equal to the density of mass transfer. The former is given by / pndr, while the latter is 
given by m f (3e/3p)ndr, (de/dp) being the “velocity” of the quasiparticle with momentum 
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p and energy e . 23 Thus 


/ pndr = m f — ndr. (14) 

J dp 

Varying the distribution function by Sn and making use of equation (4), we obtain 

JpSndr = m j ^Sndr + m jj j 'dr'j ndr 

/ $ £ C C 0 yi! 

—Sndr — m f(p,p') — Sndrdr') (15) 

dp JJ dp' 

in obtaining the last expression, we have interchanged the variables p and p and have 
also carried out an integration by parts. In view of the arbitrariness of the variation Sn, 
equation (15) requires that 


P 

m 


de 

dp 



,dn' , 
p) - dr . 

H dp' 


(16) 


We apply this result to quasiparticles with momenta close to pf, at the same time, we 
replace the distribution function n' by a “step” function, whereby 


dn' 
dp 7 


P , 

= - H l S(f/-p H ). 
P 


This enables us to carry out integration over the magnitude p' of the momentum, so that 

, dn' 2p' 2 dp'do)' 




h 3 


2 PF 
" h 3 


J f(9)p' F da)', 


(17) 


dw' being the element of a solid angle; note that we have contracted the arguments of 
the function / because in simple situations it depends only on the angle between the 
two momenta. Inserting (17) into (16), with p — p F , making a scalar product with p F and 
dividing by p F , we obtain the desired result 


- = —+^-A[f(9)cosed(o'. (18) 

m m* 2h 3 

If the function / depends on the spins Si and .s' 2 of the particles involved, then the factor 4 
in front of the integral will have to be replaced by a summation over the spin variables. 

23 Since the total number of quasiparticles in the liquid is the same as the total number of real particles, to obtain the 
net transport of mass by the quasiparticles one has to multiply their number by the mass m of the real particle. 
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We now derive a formula for the velocity of sound at absolute zero. From first principles, 
we have 24 

2 dP 0 V 2 / dP(j\ 

<i! d{mN/V) mN\dVJ N ' 

In the present context, it is preferable to have an expression in terms of the chemical poten¬ 
tial of the liquid. This can be obtained by making use of the formula Ndp 0 — VdP 0> see 
Problem 1.16, from which it follows that 25 

/3 mo \ = _V(dpo\ 

\8NJ v N\dV ) N N 2 \ dV ) N 


and hence 



(19) 


Now, //o = e(pF ) = £f'< therefore, the change Spo arising from a change SN in the total 
number of particles in the system is given by 


8po = ^Sp F + [ f(p P ,p')8n'dr'. (20) 

oPf J 

The first part in (20) arises from the fact that a change in the total number of particles in 
the system inevitably alters the value of the limiting momentum pp ; see equation (1), from 
which (for constant V ) 


Spp/pF = ]^8N/N 


and hence 


dsp Pp 8N 

dp F P 3 in* N 


( 21 ) 


24 At T = 0,S= 0; so there is no need to distinguish between the isothermal and adiabatic compressibilities of the 
liquid. 

25 Since no is an intensive quantity and, therefore, it depends on N and V only through the ratio N/V, we can write: 
p 0 = fi 0 (N/V). Consequently, 


dp 0 \ ,(d(N/V)\ 
dNjy ll °{ 3 N ) v 


— Fo 


1 

V 


and 



= -Mo 


N 

W 


( 3/r-o \ 


/ 3/ro \ 


Hence 


V 
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The second part arises from equation (4). It will be noted that the variation Sn' appearing 
in the integral of equation (20) is significant only for p' — pp) we may, therefore, write 



p')Sn'dr' 



( 22 ) 


Substituting (21) and (22) into (20), we obtain 

(w) v =SN + ^ff (mJ ' 

Making use of equations (18) and (1), we finally obtain 


(23) 


N fdpo\ Pp 


Cn = . 

0 m \ dN 


P F 

3 m 2 ' 6m/! 3 


■4 fnexi — cos d)dcL>'. 


(24) 


Once again, if the function/ depends on the spins of the particles, then the factor 4 in front 
of the integral will have to be replaced by a summation over the spin variables. 

For illustration, we shall apply this theory to the imperfect Fermi gas studied in 
Section 11.7. To calculate f(p,a m ,p’,o'), we have to differentiate twice the sum of expres¬ 
sion (11.7.12), with uq = 4itah 2 /m, and expression (11.7.22) with respect to the distribution 
function n(p,a ) and then substitute p — p' = pp. Performing the desired calculation, then 
changing summations into integrations and carrying out integrations by simple means, we 
find that the function/ is spin-dependent — the spin-dependent term being in the nature 
of an exchange term, proportional to S] sz- The complete result, according to Abrikosov 
and Khalatnikov (1957), is 


f(p,o\pt ,o r ) =A(G) + B(9)si s 2 , 


(25) 


where 


A(9) = 


2nah 2 
m 


1 + 2 a 


31V\ 1/3 ( o cose l + sin(0/2) 1 
nV ) l + 2sin(e/2) “ 1 - sin(6>/2) j 


and 


B(9) = - 


87 rah 2 
m 


1 + 2 a 


3 N \ 1/3 
xV) 



ln 1 + sin(6»/2) 1 
ln i-sin(e/2)J 


a being the scattering length of the two-body potential and 6 the angle between the 
momentum vectors p F and p' F . Substituting (25) into formulae (18) and (24), in which the 
factor 4 is now supposed to be replaced by a summation over the spin variables, we find 
that while the spin-dependent term B(d)Si ■ S 2 does not make any contribution toward the 
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final results, the spin-independent term A(0) leads to 26 


t _ t 8 
m* m 15 m 



a‘ 


2 


( 26 ) 


and 


0 3 m 2 m 2 V 


.2 _ Pf 2nah 2 N 


4 

1+ —(11 - 21n2) 



(27) 


the latter result is identical to expression (11.7.33) derived in the preceding section. Pro¬ 
ceeding backward, one can obtain from equation (27) corresponding expressions for the 
ground-state pressure Pq and the ground-state energy To, namely equations (11.7.32) and 
(11.7.31), as well as the ground-state chemical potential po, as quoted in Problem 11.15. 


11.9 Condensation in Fermi systems 


The discussion of the T = 0 Fermi liquid in Sections 11.7 and 11.8 applies when the inter¬ 
actions between the fermions are strictly repulsive. The resulting Fermi liquid has a ground 
state and quasiparticle excitations that are qualitatively similar to the ideal Fermi gas. 
However, for fermions with attractive interactions, no matter how weak, the degenerate 
Fermi gas is unstable due to the formation of bosonic pairs. This leads to a number of 
important phenomena including superconductivity in metals, superfluidity in 3 He, and 
condensation in ultracold Fermi gases. In low-temperature superconductors, screening 
and the electron-phonon interaction result in a retarded attraction between quasiparti¬ 
cles on opposite sides of the Fermi surface. The formation of these so-called Cooper pairs 
leads to the creation of a superconducting state with critical temperature 


1 


kT c mhiDD exp 


( 1 ) 


m F )\uo\ 


where AT(cf) is the density of states per spin configuration at the Fermi surface, uq is 
the weak attractive coupling between electrons, and hcojj is the Debye energy discussed 
in Section 7.4 since the coupling is due to the acoustic phonons. As can be seen from 
equation (1), the phase transition temperature is nonperturbative in Uq. A complete treat¬ 
ment of superconductivity is far beyond the scope of this section, so we refer the reader 
to the original papers by Cooper (1956) and Bardeen, Cooper, and Schrieffer (1957) and 
the texts on superconductivity by Tilley and Tilley (1990) and Tinkham (1996). The case of 
superfluidity in 3 He is surveyed by Vollhardt and Wolfle (1990). 

Bosonic condensation has also recently been observed in trapped ultracold atomic 
Fermi gases. The sign and size of the atomic interactions in ultracold gases can be tuned 

26 In a dense system, such as liquid He 3 , the ratio m*/m would be significantly larger than unity. The experimen¬ 
tal work of Roberts and Sydoriak (1955), on the specific heat of liquid He 3 , and the theoretical work of Brueckner and 
Gammel (1958), on the thermodynamics of a dense Fermi gas, suggest that the ratio (m*/m ) ne 3 ~ 1.85. 
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with a magnetic field near Feshbach resonance allowing unprecedented experimental 
control of interactions. In particular, experimenters can create a low-lying molecular 
bound state or a weakly attractive interaction without allowing a molecular bound state 
to form. If interaction between pairs of fermions allows the formation of bound bosonic 
molecules, the ground state of a degenerate Fermi gas will be destablized since molecules 
will form and, if the density of the bosonic molecules is large enough, they will Bose- 
condense — see Greiner, Regal, and Jin (2003); Jochim et al. (2003); and Zwierlein et al. 
(2003). 

For weakly attractive interactions, the fermionic system condenses into a BCS-like state 
and provides an excellent experimental environment for testing theoretical predictions 
due to the well-understood nature and experimental control of the atomic interactions. 
Theory predicts a smooth crossover from BCS to Bose-Einstein condensation (BEC) 
behavior as the magnitude of the attractive interaction parameter uq is varied from values 
small to large. BCS theory describes the behavior for weak coupling. For broad Fesh¬ 
bach resonances of trapped fermions, the most common experimental situation, the BCS 
critical temperature is given by 


kT c 
ha> o 


g F 
haiQ 


exp 


2 kp\a\ 


-N 1 ' 3 


exp 




( 2 ) 


where &>o = ^ ?> is the average oscillation frequency of atoms in the trap, a osc — 

V fr/mwo , and kp = V2 mep/h is the Fermi wavevector; see Pitaevskii and Stringari (2003), 
Leggett (2006), and Pethick and Smith (2008). For large negative scattering lengths, the 
transition temperature smoothly crosses over to the BEC limit with noninteracting Bose- 
condensation temperature, see equation (7.2.6), 


kT c / N \ 1/3 
h^o^\2S(3)) 


(3) 


since the number of Cooper pairs is N/2. The ratio of the transition temperature in the 
BEC limit and the Fermi temperature from equation (8.4.3) is 


fF 


i y / 3 

12f (3)/ 


0.41. 


(4) 


Mean-field analysis of the broad resonance limit (Leggett, 2006) and analytical analysis of 
the narrow resonance limit (Gurarie and Radzihovsky, 2007) both indicate that the phase 
transition temperature has a maximum at intermediate coupling. Figure 11.7 is a sketch of 
the critical temperature as a function of the coupling parameter uq. 

Experimental observations of condensation in a degenerate Fermi gas in the BEC-BCS 
crossover region by Regal, Greiner, and Jin (2004) are shown in Figure 11.8. They used a 
Feshbach resonance to tune the scattering length of 40 K into an attractive range (a < 0) that 
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Fermi liquid 


^0 


BCS limit 


FIGURE 11.7 Sketch of the BEC-BCS phase diagram on the BCS side of the Feshbach resonance for ultracold 
fermions in an atomic trap. The scattering length a and coupling Uq = 4 nh 2 a/m can be tuned from positive values 
to negative with the help of a magnetic field. Positive (repulsive) couplings result in a Fermi liquid. Negative 
(attractive) couplings result in a BCS condensation at low temperatures. The nature of the condensed phase varies 
smoothly from BCS behavior for small negative coupling to Bose-Einstein behavior for large negative coupling. 
The phase transition temperature has a maximum at intermediate coupling. 



FIGURE 11.8 Time-of-flight images showing condensation of fermions in an ultracold atomic gas. The images show 
the quantum mechanical projection of the fermionic system onto a molecular gas and are shown for three values of 
the magnetic field on the BCS side of the Feshbach resonance for an ultracold trapped gas of 40 K. The temperature 
of the Fermi gas is (fcr/cp) 0.07. The condensed fraction varies from about 1 to 10 percent of the original cold 
fermions in the trap; see Regal, Greiner, and Jin (2004). Figure courtesy of NIST/JILA/University of Colorado. 

does not allow a two-particle molecular bound state and observed the fermions condens¬ 
ing into a BCS-like macroscopic quantum state. They explored the BEC-BCS crossover 
behavior by tuning \ a\ from small values to large. 


Problems 


11.1. (a) Show that, for bosons as well as fermions, 



where H is the Hamiltonian operator defined by equation (11.1.4). 
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(b) Making use of the foregoing result, show that the equation 


— — (0\f,(r l )...f(r N )H\4> NE ) =E— — (0\f(ri).. .if(r N )\'ii NE ) 

VN\ Vn\ 


= Ety NE (.ri,...r N ) 


is equivalent to the Schrodinger equation (11.1.15). 

11.2. The grand partition function of a gaseous system composed of mutually interacting bosons is 
given by 


In a 


PV 

kT 


1 / 

I? 


g 5 /2(Z)-2{g 3 /2(Z)} 2 ” +0 



Study the analytic behavior of this expression near z= 1 and show that the system exhibits the 
phenomenon of Bose-Einstein condensation when its fugacity assumes the critical value 


zc=i+ 4? G) 



Further show that the pressure of the gas at the critical point is given by (Lee and Yang 1958, 
1960b) 


Pc 

kT c 


1 

4 




compare these results to equations (11.2.16) through (11.2.18). 

11.3. For the imperfect Bose gas studied in Section 11.2, calculate the specific heat Cy near absolute 
zero and show that, as T 0, the specific heat vanishes in a manner characteristic of a system 
with an “energy gap" A = 47rah 2 n/m. 

11.4. (a) Show that, to first order in the scattering length a, the discontinuity in the specific heat Cy of 

an imperfect Bose gas at the transition temperature T c is given by 

, 9 a 

(Cv)t=t c - ~(Cv)t=t c + = Elk—- f (3/2), 

ZA C 

while the discontinuity in the bulk modulus K is given by 


( K)t=t c _ - ( K)t=t c + 


4jzah 2 

mvf 


(b) Examine the discontinuities in the quantities ( d 2 P/dT 2 ) v and (9 2 /r/3r 2 )„ as well, and show 
that your results are consistent with the thermodynamic relationship 


C v =VT 



3 2 m\ 

3 r 2 1 ' 

/ V 
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11.5. (a) Complete the mathematical steps leading to equations (11.3.15) and (11.3.16). 

(b) Complete the mathematical steps leading to equations (11.3.23) and (11.3.24). 

11.6. The ground-state pressure of an interacting Bose gas (see Lee and Yang, 1960a) turns out to be 

64 ^l ,2 m l / 2 a 
0 8jrah 2 157r ft 


where /m is the ground-state chemical potential of the gas. It follows that 


and 


fdP 0 \ 

\dp. 0 J 


fj. 0 m 

4rtah 2 


16 nl ,2 mh 2 a 
3tt ft 


Eo _ , n , Mo m 

y = (n M0 o )=^2 
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32 /ig /2 OT 1/2 a 
57 T ft 


Eliminating pa from these results, derive equations (11.3.16) and (11.3.17). 

11.7. Show that in an interacting Bose gas the mean occupation number ii p of the real particles and the 
mean occupation number N p of the quasiparticles are connected by the relationship 


N„ 


-<* Z P (N P - 


1) 


1 -at, 


(p^O), 


where a p is given by equations (11.3.9) and (11.3.10). Note that equation (11.3.22) corresponds to 
the special case N p = 0. 

11.8. The excitation energy of liquid He 4 , carrying a single excitation above the ground state, is 
determined by the minimum value of the quantity 


e = 




J2^f + V~Eo 



y*qid 3N r, 


where E 0 denotes the ground-state energy of the liquid while 4q according to Feynman, is given 
by equation (11.4.3). Show that the process of minimization of this expression leads to 
equation (11.4.5) for the energy of the excitation. 

[Hint: First express e in the form 

e= jLf l v / (r) l 2d3r ///*(riJ/(r2)g(rz-ri)d 3 nd 3 f5 ! . 

Then show that s is minimum when/(r) is of the form (11.4.4).] 

11.9. Show that, for a sufficiently large momentum Me (in fact, such that the slope ds/die of the energy 
spectrum is greater than the initial slope he), a state of double excitation in liquid He 4 is 
energetically more favorable than a state of single excitation, that is, there exist wavevectors 
fci and fc 2 such that, while k\ + k 2 = k, e(fci) + e(k 2 ) < e(fc). 

11.10. Using Fetter’s analytical approximation, 


fi(p') 


P' 

7(1 +p' 2 )’ 
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for the solution of equation (11.5.23) with 5=1, calculate the energy (per unit length) associated 
with a quantized vortex line of unit circulation. Compare your result with the one quoted in 
(11.5.26). 

11.11. (a) Study the nature of the velocity field arising from a pair of parallel vortex lines, with 5i =+l 

and 52 = — 1, separated by a distance d. Derive and discuss the general equation of the stream 
lines. 

(b) Again, using Fetter’s analytical approximation for the functions /(pj) and f(p' 2 ), calculate 
the energy (per unit length) of the system and show that its limiting value, as d -y 0, is 
1 l7r ft 2 «o/ 12m. Making use of this result, derive expression (11.6.8) for the critical velocity 
of superflow. 

11.12. Establish the asymptotic formula (11.7.30) for the function F(z 0 ). 

[Hint-. Write the coefficient that appears in the sum (11.7.24) in the form 


1 

v / (rsf)(r+5)(r+ 1 ) 



OO 

r-y i s-2?t-Kr+s)-W+t) dxdY dZd$dii. 

o 



Insert this expression into (11.7.24) and carry out summations over r, s, and t, with the result 


F(z 0 ) = 


; 3/2 


oo 

/ 


z 0 1 e x2 +%+ r i +1 z, 


1 e y2 +S + 1 z 0 l e z2+r i + 1 


dXdYdZd^ dq. 


In the limit z 0 -> oo, the integrand is essentially equal to 1 in the region R defined by 
X 2 + % + q < lnzo, Y 2 +f<lnzo, and Z 2 + p<lnzo; 


outside this region, it is essentially 0. Hence, the dominant term of the asymptotic expansion is 



R 


1 -dXdYdZd^dq, 


which, in turn, reduces to the double integral 


8 

tt 3 / 2 


JJ (lnz 0 - f - )?) 1/2 (lnzo - £) 1/2 (lnz 0 - q) l,2 d$ dry, 


the limits of integration here are such that not only f < (lnzo) and q < (lnzo), but also 
(§ + ??)< (lnzo). The rest of the calculation is straightforward.] 

11.13. The grand partition function of a gaseous system composed of mutually interacting, spin-half 
fermions has been evaluated by Lee and Yang (1957), with the result 27 


, PV V 

in a = —— = — 
kT A 3 

2/5/2 (Z) ~ — i/3/2 (-Z)j 2 


+ / ^rfl/2(Z){f 3 /2(z)) 2 - ^-F(Z) + ■ ■ ■ 


27 For the details of this calculation, see Lee and Yang (1959b) where the case of bosons, as well as of fermions, with 
spin / has been treated using the binary collision method. The second-order result for the case of spinless bosons was 
first obtained by Huang, Yang, and Luttinger (1957) using the method of pseudopotentials. 
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where z is the fugacity of the actual system (not of the corresponding noninteracting system, 
which was denoted by the symbol z 0 in the text); the functions /„(z) and F(z) are defined in a 
manner similar to equations (11.7.10) and (11.7.24). From this result, one can derive expressions 
for the quantities E(z, V, T) and N(z, V, T) by using the formulae 


E{z, V, T) = kT 2 9(lnQ) and N(z, y, T) 
3 T 


3(lnQ) 

3(lnz) 


2V" 

~w 


f3/2(Zo) 


(a) Eliminating z between these two results, derive equation (11.7.25) for E. 

(b) Obtain the zero-point value of the chemical potential //., correct to second order in (a/ a), and 
verify, with the help of equations (11.7.31) and (11.7.32), that 


(E + PV)t =o = 7V(M)r=o- 


[Hint: At T = 0 K, [i = (8E/8N) v .} 

(c) Show that the low-temperature specific heat and the low-temperature entropy of this gas are 
given by (see Pathria and Kawatra, 1962) 


Cv _ _S_ 
Nk ~ Nk 




8 

15;r 2 


(71n2 - l)(fcffl) 2 -I- 


where kp = (3n 2 n) 1 l 3 . Clearly, the factor within square brackets is to be identified with the 
ratio m*/m; see equations (11.8.12) and (11.8.26). 

[Hint: To determine Cv to the first power in T, we must know E to the second power in T. 
For this, we require higher-order terms of the asymptotic expansions of the functions/„ (z) 
and F(z): these are given by 

8 Jr 3 / 2 

/ 5 / 2 CZ) = (lnz) 5 / 2 + ——(biz) 1 / 2 + 0(1), 

15 3 
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2(2 ln2 D ^l/ 2 ^ 3/2 + 0(lnz) 5 / 4 _ 


The first three results here follow from the Sommerfeld lemma (E.17); for the last one, see 
Yang (1962).] 
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11.14. The energy spectrum e(p) of a gas composed of mutually interacting, spin-half fermions is given 
by (Galitskii, 1958; Mohling, 1961) 


e(p) ? 4 , 4,, 

[i2 /9m ^x 2 + — (kFa) + ] 5t 2 (k F a) 2 
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where x = p/pf < and k = p/ft. Show that, for k close to fcf, this spectrum reduces to 
e(p) 
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Using equations (11.8.10) and (11.8.11), check that this expression leads to the result 

yyi* g 

— -1 + t ?ln2 - D(fcffl) 2 . 
m 15 tt 2 

11.15. In the ground state of a Fermi system, the chemical potential is identical to the Fermi energy: 
(p)r=o = £(Pf)- Making use of the energy spectrum e(p) of the previous problem, we obtain 


Pf r 

(p)r=o - ^ 
2m 


1 + ^-(k F a) + 7^(11 - 21n2)(fc F fl) 2 
3 7T 157T Z 


Integrating this result, rederive equation (11.7.31) for the ground-state energy of the system. 

11.16. The energy levels of an imperfect Fermi gas in the presence of an external magnetic field B, to 
first order in a, may be written as 


f?« = I>+ + ViL 

p 


47 rah 2 


2m mV 


N + N~ — p*B(N + — N~)\ 


see equations (8.2.8) and (11.7.12). Using this expression fori?,, and following the procedure 
adopted in Section 8.2.A, study the magnetic behavior of this gas — in particular, the zero-field 
susceptibility y (T). Also examine the possibility of spontaneous magnetization arising from the 
interaction term with a > 0. 

11.17. Rewrite the Gross-Pitaevskii equation and the mean field energy, see equations (11.2.21) and 

(11.2.23), for an isotropic harmonic oscillator trap with frequency cog in a dimensionless form by 
defining a dimensionless wavefunction ft = flosc/ VWV, a dimensionless length s = r/a os c , and a 
dimensionless energy E/Nhcog. Show that the dimensionless parameter that controls the mean 
field energy is Na/a 0sc , where N is the number of particles in the condensate, a is the scattering 
length, and a osc = *Jh/ma>o- Next, show that the dimensionless versions of the Gross-Pitaevskii 
equation and the mean field energy are 


1 ~, 1 , 47T Na 9 

— V 2 i/ f H— s 2 "'!' 4- \fr\ ft = pff , 

2 2 ^osc 
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and 


Elf] 

NhcDQ 




2nNa 

a osc 



ds. 


11.18. Solve the Gross-Pitaevskii equation and evaluate the mean field energy, see equations (11.2.21) 
and (11.2.23), for a uniform Bose gas to show that this method yields precisely equation (11.2.6). 

11.19. Solve the Gross-Pitaevskii equation (11.2.23) in a harmonic trap for the case when the scattering 
length a is zero. Show that this reproduces the properties of the ground state of the noninteracting 
Bose gas. 

11.20. Solve the Gross-Pitaevskii equation and evaluate the mean field energy, see equations (11.2.21) 
and (11.2.23), for an isotropic harmonic oscillator trap with frequency a>o for the case Na/a OS c » 1 
by ignoring the kinetic energy term. Reproduce the results (11.2.25) through (11.2.28). 





Phase Transitions: Criticality, 
Universality, and Scaling 


Various physical phenomena to which the formalism of statistical mechanics has been 
applied may, in general, be divided into two categories. In the first category, the micro¬ 
scopic constituents of the given system are, or can be regarded as, practically noninteract¬ 
ing; as a result, the thermodynamic functions of the system follow straightforwardly from 
a knowledge of the energy levels of the individual constituents. Notable examples of phe¬ 
nomena belonging to this category are the specific heats of gases (Sections 1.4 and 6.5), 
the specific heats of solids (Section 7.4), chemical reactions and equilibrium constants 
(Section 6.6), the condensation of an ideal Bose gas (Sections 7.1 and 7.2), the spectral 
distribution of the blackbody radiation (Section 7.3), the elementary electron theory of 
metals (Section 8.3), the phenomenon of paramagnetism (Sections 3.9 and 8.2), and so on. 
In the case of solids, the interatomic interaction does, in fact, play an important physical 
role; however, since the actual positions of the atoms, over a substantial range of temper¬ 
atures, do not depart significantly from their mean values, we can rewrite our problem 
in terms of the so-called normal coordinates and treat the given solid as an “assembly of 
practically noninteracting harmonic oscillators.” We note that the most significant feature 
of the phenomena falling in the first category is that, with the sole exception of Bose- 
Einstein condensation, the thermodynamic functions of the systems involved are smooth 
and continuous! 

Phenomena belonging to the second category, however, present a very different sit¬ 
uation. In most cases, one encounters analytic discontinuities or singularities in the 
thermodynamic functions of the given system which, in turn, correspond to the occur¬ 
rence of various kinds of phase transitions. Notable examples of phenomena belonging 
to this category are the condensation of gases, the melting of solids, phenomena associ¬ 
ated with the coexistence of phases (especially in the neighborhood of a critical point), the 
behavior of mixtures and solutions (including the onset of phase separation), phenom¬ 
ena of ferromagnetism and antiferromagnetism, the order-disorder transitions in alloys, 
the superfluid transition from liquid He I to liquid He II, the transition from a normal to a 
superconducting material, and so on. The characteristic feature of the interparticle inter¬ 
actions in these systems is that they cannot be “removed” by means of a transformation 
of the coordinates of the problem; accordingly, the energy levels of the total system can¬ 
not, in any simple manner, be related to the energy levels of the individual constituents. 
One finds instead that, under favorable circumstances, a large number of microscopic 
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constituents of the system may exhibit a tendency of interacting with one another in a 
rather strong, cooperative fashion. This cooperative behavior assumes macroscopic signifi¬ 
cance at a particular temperature T c> known as the critical temperature of the system, and 
gives rise to the kind of phenomena listed previously. 

Mathematical problems associated with the study of cooperative phenomena are quite 
formidable. 1 To facilitate calculations, one is forced to introduce models in which the 
interparticle interactions are considerably simplified, yet retain characteristics that are 
essential to the cooperative aspect of the problem. One then hopes that a theoretical study 
of these simplified models, which still involves serious difficulties of analysis, will repro¬ 
duce the most basic features of the phenomena exhibited by actual physical systems. For 
instance, in the case of a magnetic transition, one may consider a lattice structure in which 
all interactions other than the ones among nearest-neighbor spins are neglected, ft turns 
out that a model as simplified as that captures practically all the essential features of the 
phenomenon — especially in the close neighborhood of the critical point. The inclusion 
of interactions among spins farther out than the nearest neighbors does not change these 
features in any significant manner, nor are they affected by the replacement of one lattice 
structure by another so long as the dimensionality of the lattice is the same. Not only this, 
these features may also be shared, with little modification, by many other physical sys¬ 
tems undergoing very different kinds of phase transitions, for example, gas-liquid instead 
of paramagnetic-ferromagnetic. This “unity in diversity” turns out to be a hallmark of 
the phenomena associated with phase transitions — a subject we propose to explore in 
considerable detail in this and the following two chapters, but first a few preliminaries. 


12.1 General remarks on the problem 
of condensation 

We consider an Ai-particle system, obeying classical or quantum statistics, with the proviso 
that the total potential energy of the system can be written as a sum of two-particle terms 
u(rij), with i < j. The function u(r) is supposed to satisfy the conditions 


u(r) 

= +00 

for 

r < a, 


0 > u(r) > —e 

for 

a <r < r* 

; (l) 

u(r) 

= 0 

for 

r>r* 



see Figure 12.1. Thus, each particle may be looked upon as a hard sphere of diameter a, 
surrounded by an attractive potential of range r* and of (maximum) depth s. From a prac¬ 
tical point of view, conditions (1) do not entail any “serious restriction” on the two-body 
potential, for the interparticle potentials ordinarily met with in nature are not materially 

fin this connection, one should note that the mathematical schemes developed in Chapters 10 and 11 give reliable 
results only if the interactions among the microscopic constituents of the given system are sufficiently weak — in fact, 
too weak to bring about cooperative transitions. 
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FIGURE 12.1 A sketch of the interparticle potential u(r), as given by equation (1). 


different from the one satisfying these conditions. We, therefore, expect that the conclu¬ 
sions drawn from the use of this potential will not be very far from the realities of the actual 
physical phenomena. 

Suppose that we are able to evaluate the exact partition function, Qn(V , T), of the 
given system. This function will possess certain properties that have been recognized and 
accepted for quite some time, though a rigorous proof of these was first attempted by van 
Hove as late as in 1949. 2 These properties can be expressed as follows: 

(i) In the thermodynamic limit (i.e., when N and V —»• oo while the ratio N/V stays 
constant), the quantity N~ 1 In Q tends to be a function only of the specific volume 
v (= V/N) and the temperature T; this limiting form may be denoted by the symbol 
f(v, T) . It is natural to identify f(v, T ) with the intensive variable —A/NkT, where A is 
the Helmholtz free energy of the system. The thermodynamic pressure P is then 
given by 


P(v, T) = — 



( 2 ) 


which turns out to be a strictly nonnegative quantity. 

(ii) The function /(p, T ) is everywhere concave, so the slope ( dP/dv)r of the (P, p)-curve 
is never positive. While at high temperatures the slope is negative for all v, at lower 
temperatures there can exist a region (or regions) in which the slope is zero and, 
consequently, the system is infinitely compressible! The existence of such regions, in 
the (P, p)-diagram, corresponds to the coexistence of two or more phases of different 
density in the given system; in other words, it constitutes direct evidence of the onset 
of a phase transition in the system. In this connection it is important to note that, so 
long as one uses the exact partition function of the system, isotherms of the van der 
Waals type, which possess unphysical regions of positive slope, never appear. On the 


2 For historical details, see Griffiths (1972, p. 12). 
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FIGURE 12.2 An unphysical isotherm corrected with the help of the Maxwell construction; the horizontal line is 
such that the areas A and B are equal. The "corrected" isotherm corresponds to a phase transition, taking place at 
pressure P(T), with densities vf and 1 of the respective phases. 


other hand, if the partition function is evaluated under approximations, as we did in 
the derivation of the van der Waals equation of state in Section 10.3, isotherms with 
unphysical regions may indeed appear. In that case the isotherms in question have 
got to be “corrected,” by introducing a region of “flatness” (dP/dv — 0), with the help 
of the Maxwell construction of equal areas ; see Figure 12.2. 3 The real reason for the 
appearance of unphysical regions in the isotherms is that the approximate evalua¬ 
tions of the partition function introduce, almost invariably (though implicitly), the 
restraint of a uniform density throughout the system. This restraint eliminates the 
very possibility of the system passing through states in which there exist, side by 
side, two phases of different densities; in other words, the existence of a region of 
“flatness” in the ( P , v )-diagram is automatically ruled out. On the other hand, an 
exact evaluation of the partition function must allow for all possible configurations 
of the system, including the ones characterized by a simultaneous existence of two 
or more phases of different densities. Under suitable conditions (for instance, when 
the temperature is sufficiently low), such a configuration might turn out to be the 
equilibrium configuration of the system, with the result that the system shows up in 
a multiphase, rather than a single-phase, state. We should, in this context, mention 
that if in the evaluation of the partition function one introduces no other 
approximation except the assumption of a uniform density in the system, then the 
resulting isotherms, corrected with the help of the Maxwell construction, would be 
the exact isotherms of the problem. 

3 The physical basis of the Maxwell construction can be seen with the help of the Gibbs free energy density g(T,P). 
Since dg = — sdT + vdP and along the “corrected” isotherm dP = dT = 0, it follows thatgi = g 3 ; see Figure 12.2. To achieve 
the same result from the theoretical isotherm (along which dT = 0 but dP^O), we clearly require that the quantity vdP, 
integrated along the isotherm from state 1 to state 3, must vanish; this leads to the “theorem of equal areas.” 
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(iii) The presence of an absolutely flat portion in an isotherm, with mathematical 

singularities at its ends, is, strictly speaking, a consequence of the limiting process 
N -»• oo. If N were finite, and if the exact partition function were used, then the 
quantity P', defined by the relation 


P = kT 


d lnQ\ 

dV ) N,T 


(3) 


would be free from mathematical singularities. The ordinarily sharp corners in an 
isotherm would be rounded off; at the same time, the ordinarily flat portion of the 
isotherm would not be strictly flat — it would have for large N a small, negative 
slope. In fact, the quantity P' in this case would not be a function of v and T alone; it 
would depend on the number N as well, though in a thermodynamically negligible 
manner. 

If we employ the grand partition function GL, as obtained from the exact partition 
functions Q/v, namely 


GL (z,V,D=Y,Qn( V ’D zN ’ (4) 

N>0 

a similar picture results. To see this, we note that for real molecules, with a given V, the 
variable N will be bounded by an upper limit, say N m , which is the number of molecules 
that fill the volume V “tight-packed”; obviously, N m ~ V/er 3 . For N > N m , the potential 
energy of the system will be infinite; accordingly, 


Qn(N > N m ) = 0. (5) 

Hence, for all practical purposes, our power series in (4) is a polynomial in z (which is > 0) 
and is of degree N m . Since the coefficients Qn are all positive and Q 0 = 1, the sum GL > 1. 
The thermodynamic potential In GL is, therefore, a well-behaved function of the parameters 
z, V, and T. Consequently, so long as V (and hence N m ) remains finite, we do not expect 
any singularities or discontinuities in any of the functions derived from this potential. A 
nonanalytic behavior could appear only in the limit ( V, N m ) oo. 

We now define P' by the relation 

kT 

P' =— InQ (V finite); (6) 

since GL > l,P' > 0. The mean number of particles and the mean square deviation in this 
number are given by the formulae 


N = 


d ln<2\ 


3 In z 


V,T 


(7) 
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and 


\ / V,T 

respectively; see Sections 4.2 and 4.5. Accordingly, 

/9ln<2\ _ / 9 In 0 \ / / dN \ _ N 

\ 9 N ) v,t \91nz/y T / y91nzy^^ /y2_]y 2 


At the same time, writing v for V/N and using (6), we have 

/9ln&\ V /dP'\ _ v 2 /dP'\ 

V dN )v,t kT \dN) v,t hT\dv)y j 

Comparing (9) and (10), we obtain 

/ dP' \ _ kT N 3 

\dv)vj V 2 W_N 2 ’ ^ ^ 

which is clearly nonpositive. 4 For finite V , expression (11) will never vanish; accordingly, 
P' will never be strictly constant. Nevertheless, the slope (dP'/dv) can, in a certain region, 
be extremely small — in fact, as small as 0(1 /N); such a region would hardly be distin¬ 
guishable from a phase transition because, on a macroscopic scale, the value of P' in such 
a region would be as good as a constant. 5 

If we now define the pressure of the system by the limiting relationship 


P( v, T) = Lim P' ( v , T; V) = kT Lim I — In Q.(z, V, T) 

V—>00 V—KX) V V 


( 12 ) 


then we can expect, in a set of isotherms, an absolutely flat portion (dP/dv = 0), with 
sharp corners implying mathematical singularities. The mean particle density h would 
now be given by 


n = Lim 

V—>oo 


" 1 9 lnfihz, V,T)~ 
V d lnz 


(13) 


it seems important to mention here that the operation V ->• oo and the operation 9/9 In z 
cannot be interchanged freely. 

In passing, we note that the picture emerging from the grand partition function GL, 
which has been obtained from the exact partition functions Qn, remains practically 


4 Compare equation (11), which has been derived here nonthermodynamically, with equation (4.5.7) derived earlier. 

5 The presence of such a region entails that (IV 2 — N 2 ) be 0(1V 2 ). This implies that the fluctuations in the variable N 
be macroscopically large, which in turn implies equally large fluctuations in the variable v within the system and hence 
the coexistence of two or more phases with different values of V. In a single-phase state, ( N 2 — N ) is O(N); the slope 
OP'/dv) is then 0(N ), as an intensive quantity should be. 
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unchanged even if one had employed a set of approximate Qn. This is so because the 
argument developed in the preceding paragraphs makes no use whatsoever of the actual 
form of the functions Qn■ Thus, if an approximate Qn leads to the van der Waals type 
of loop in the canonical ensemble, as shown in Figure 12.2, the corresponding set of Qn, 
when employed in a grand canonical ensemble, would lead to isotherms free from such 
loops (Hill, 1953). 

Subsequent to van Hove, Yang and Lee (1952) suggested an alternative approach that 
enables one to carry out a rigorous mathematical discussion of the phenomenon of con¬ 
densation and of other similar transitions. In their approach, one is primarily concerned 
with the analytic behavior of the quantities P and n, of equations (12) and (13), as func¬ 
tions of z at different values of T. The problem is examined in terms of the “zeros of the 
grand partition function GL in the complex z-plane,” with attention focused on the way 
these zeros are distributed in the plane and the manner in which they evolve as the vol¬ 
ume of the system is increased. For real, positive z, GL > 1, therefore none of the zeros will 
lie on the real, positive axis in the z-plane. However, as V -»■ oo (and hence the degree of 
the polynomial (4) and, with it, the number of zeros itself grows to infinity), the distribu¬ 
tion of zeros is expected to become continuous and, depending on T, may in fact converge 
on the real, positive axis at one or more points z c . If so, our functions P(z) and n(z), even 
with z varied along the real axis only, may, by virtue of their relationship to the function 
In GL, turn out to be singular at the points z — z c . The presence of such a singularity would 
imply the onset of a phase transition in the system. For further details of this approach, see 
Sections 12.3 and 12.4 of the first edition of this book; see also Griffiths (1972, pp. 50-58). 


12.2 Condensation of a van der Waals gas 

We start with the simplest, and historically the first, theoretical model that undergoes a 
gas-liquid phase transition. This model is generally referred to as the van der Waals gas 
and obeys the equation of state, see equation (10.3.9), 


v being the molar volume of the gas; the parameters a and b then also pertain to one 
mole of the gas. We recall that, while a is a measure of the attractive forces among the 
molecules of the system, b is a measure of the repulsive forces that come into play when 
two molecules come too close to one another; accordingly, b is also a measure of the 
“effective space” occupied by the molecules (by virtue of a finite volume that maybe asso¬ 
ciated with each one of them). In Section 10.3, the equation of state (1) was derived under 
the express assumption that v^>b\ here, we shall pretend, with van der Waals, that this 
equation holds even when v is comparable to b. 

The isotherms following from equation (1) are shown in Figure 12.3. We note that, 
for temperatures above a critical temperature T C ,P decreases monotonically with v. For 
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FIGURE 12.3 The isotherms of a van der Waals system; those for T <T C are "corrected" with the help of the 
Maxwell construction, thus leading to the coexistence curve at the top of which sits the critical point. 


T < T c , however, the relationship is nonmonotonic, so that over a certain range of v we 
encounter a region where (dP/dv) > 0; such a region is unphysical and must be “corrected” 
with the help of the Maxwell construction , 6 leading to an isotherm with a flat portion sig¬ 
naling transition from the gaseous state with molar volume v g (T) to the liquid state with 
molar volume V/(T) at a constant pressure P(T). For 17 < v < v g , the system resides in a 
state of mixed phases — partly liquid, partly gaseous — and, since the passage from one 
end of the flat portion to the other takes place with A.v ± 0 but A P = 0, the system is all 
along in a state of infinite compressibility; clearly, we are encountering here a brand of 
behavior that is patently singular. As T increases toward T c , the transition takes place at 
a comparatively higher value of P, with v g less than and V[ more than before — so that, 
as T -+ T c> both v g and Vi approach a common value v c that may be referred to as the 
critical volume ; the corresponding value of P may then be designated by P c , and we find 
ourselves located at the critical point of the system. The locus of all such points as vi(T) 
and Vg(T) is generally referred to as the coexistence curve, for the simple reason that in the 
region enclosed by this curve the gaseous and the liquid phases mutually coexist; the top 
of this curve, where Vi — v g , coincides with the critical point itself. Finally, the isotherm 


6 A more precise formulation of the van der Waals theory, as the limit of a theory with an infinite range potential, has 
been formulated by Kac, Uhlenbeck and Hemmer (1963). They considered the potential 


lUr) = 


l+oo 
{ -Ke~ Kr 


for 

for 


r<a 
r > a, 


so that the integral /“ u(r)dr is simply — exp(—*rcr); when k -> 0 the potential becomes infinite in range but infinitesi¬ 
mally weak. Kac et at showed that, in this limit, the model becomes essentially the same as van der Waals’ — with one 
noteworthy improvement, that is, no unphysical regions in the (P, y)-diagram appear and hence no need for the Maxwell 
construction arises. 
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pertaining to T — T c , which, of course, passes through the critical point is referred to as the 
critical isotherm of the system; it is straightforward to see that the critical point is a point 
of inflection of this isotherm, so that both ( 9P/dv)r and (d 2 P/dv 2 )r vanish at this point. 
Using (1), we obtain for the coordinates of the critical point 


Pr = 


a , 8 a 

v c ~ 3 b, T c = 


27 b 2 ’ 


27 bR' 


( 2 ) 


so that the number 


K. = RTc/PcVc = 8/3 = 2.666 _ ( 3 ) 

We thus find that, while P c , v c , and T c vary from system to system (through the interaction 
parameters a and b ), the quantity 1C has a common, universal value for all of them — so 
long as they all obey the same (i.e., van der Waals) equation of state. The experimental 
results for /C indeed show that it is nearly the same over a large group of substances; for 
instance, its value for carbon tetrachloride, ethyl ether, and ethyl formate is 3.677, 3.814, 
and 3.895, respectively — close, though not exactly the same, and also a long way from the 
van der Waals value. The concept of universality is, nonetheless, there (even though the 
van der Waals equation of state may not truly apply). 

It is now tempting to see if the equation of state itself can be written in a universal form. 
We find that this indeed can be done by introducing reduced variables 



Using (1) and (2), we readily obtain the reduced equation of state 

yPr + (3c r — 1) = 8T r , (5) 

which is clearly universal for all systems obeying van der Waals’ original equation of 
state (1); all we have done here is to rescale the observable quantities P, v, and T in terms 
of their critical values and thereby “push the interaction parameters a and b into the back¬ 
ground.’’ Now, if two different systems happen to be in states characterized by the same 
values of v r and T r , then their P r would also be the same; the systems are then said to be in 
“corresponding states” and, for that reason, the statement just made is referred to as the 
“law of corresponding states.” Clearly, the passage from equation (1) to equation (5) takes 
us from an expression of diversity to a statement of unity! 

We shall now examine the behavior of the van der Waals system in the close neighbor¬ 
hood of the critical point. For this, we write 


P r = 1 + n, v r = 1 + i/o T r = 1 + t. 


( 6 ) 
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Equation (5) then takes the form 

n (l + 7f + 81 /f 2 + 3iA 3 ) + 3f 3 = 8t (l + 2f + f 2 ^. (7) 

First of all, along the critical isotherm (f = 0) and in the close vicinity of the critical point 
(|tt|, \if | < T), we obtain the simple, asymptotic result 

3 o 

( 8 ) 

which is indicative of the “degree of flatness” of the critical isotherm at the critical point. 
Next, we examine the dependence of on t as we approach the critical point from below. 
For this, we write (7) in the form 

3f 3 + 8 (jr - t) xfr 2 + (7jt - 16f) f + 2 (n - At) ~ 0. (9) 

Now, a close look at the (symmetric) shape of the coexistence curve near its top (where 
\t\ <?C 1 ) shows that the three roots t/q , 1 // 2 , and 1//3 of equation (9), which arise from the 
limiting behavior of the roots iq, v 2 , and v :i of the original equation of state (1) as T -> T c -, 
are such that | ^2 1 <«C I f 1,3 1 and 1/0 1 — hA.al- This means that, in the region of interest, 

n^41, ( 10 ) 

so that one of the roots, t/t 2 > of equation (9) essentially vanishes while the other two are 
given by 


i/f 2 + 8ti/f + 4t ~ 0. (9a) 

We expect the middle term here to be negligible (as will be confirmed by the end result), 
yielding 


Vn,3^±2|r| 1/2 ; 


(it) 


note that the upper sign here pertains to the gaseous phase and the lower sign to the liquid 
phase. 

Finally, we consider the isothermal compressibility of the system which, in terms of 
reduced variables, is determined essentially by the quantity —(d\lr/d7t) t . Retaining only the 
dominant terms, we obtain from (7) 


-m „_^_ (12) 

\dn ) t 7?r + 9^f 2 — 16f 

For t > 0, we approach the critical point along the critical isochore = 0); equation (12), 
with the help of equation (10), then gives 


1 

6 ? 


df 

dn 


f->- 0 + 


( 13 ) 
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For t < 0, we approach the critical point along the coexistence curve (on which i// 2 ~ —4 1); 
we now obtain 


d±\ ~ 1 

9jr/r->0- 12 |f| 


(14) 


For the record, we quote here results for the specific heat, Cy, of the van der Waals gas 
(Uhlenbeck, 1966; Thompson, 1988) 

£ ^ (Cv)ideal + 2 ^" ^ + 25 ^ d <Q) (15a) 

. (CV)ideal (f>0), (15b) 

which imply a finite jump at the critical point. 

Equations (8), (11), (13), (14), and (15) illustrate the nature of the critical behavior dis¬ 
played by a van der Waals system undergoing the gas-liquid transition. While it differs in 
several important respects from the critical behavior of real physical systems, it shows up 
again and again in studies pertaining to other critical phenomena that have apparently 
nothing to do with the gas-liquid phase transition. In fact, this particular brand of behav¬ 
ior turns out to be a benchmark against which the results of more sophisticated theories 
are automatically compared. 


12.3 A dynamical model of phase transitions 

A number of physico-chemical systems that undergo phase transitions can be represented, 
to varying degrees of accuracy, by an “array of lattice sites, with only nearest-neighbor 
interaction that depends on the manner of occupation of the neighboring sites.” This 
simple-minded model turns out to be good enough to provide a unified, theoretical basis 
for understanding a variety of phenomena such as ferromagnetism and antiferromag¬ 
netism, gas-liquid and liquid-solid transitions, order-disorder transitions in alloys, phase 
separation in binary solutions, and so on. There is no doubt that this model considerably 
oversimplifies the actual physical systems it is supposed to represent; nevertheless, it does 
retain the essential physical features of the problem — features that account for the prop¬ 
agation of long-range order in the system. Accordingly, it does lead to the onset of a phase 
transition in the given system, which arises in the nature of a cooperative phenomenon. 

We find it convenient to formulate our problem in the language of ferromagnetism; 
later on, we shall establish correspondence between this language and the languages 
appropriate to other physical phenomena. We thus regard each of the N lattice sites to 
be occupied by an atom possessing a magnetic moment /t, of magnitude gPBy/UU + 1)], 
which is capable of (2/+ 1) discrete orientations in space. These orientations define “dif¬ 
ferent possible manners of occupation” of a given lattice site; accordingly, the whole 
lattice is capable of (2/ + 1) N different configurations. Associated with each configuration 
is an energy E that arises from mutual interactions among the neighboring atoms of the 
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lattice and from the interaction of the whole lattice with an external field B. A statistical 
analysis in the canonical ensemble should then enable us to determine the expectation 
value, M(B, T), of the net magnetization M. The presence of a spontaneous magnetization 
M{ 0, T) at temperatures below a certain (critical) temperature T c and its absence above 
that temperature will then be interpreted as a ferromagnetic phase transition in the system 
at T = T c . 

Detailed studies, both theoretical and experimental, have shown that, for all ferromag¬ 
netic materials, data on the temperature dependence of the spontaneous magnetization, 
M(0, T), fit best with the value J =\\ see Figure 12.4. One is, therefore, tempted to infer 
that the phenomenon of ferromagnetism is associated only with the spins of the electrons 
and not with their orbital motions. This is further confirmed by gyromagnetic experiments 
(Barnett, 1944; Scott, 1951,1952), in which one either reverses the magnetization of a freely 
suspended specimen and observes the resulting rotation or imparts a rotation to the spec¬ 
imen and observes the resulting magnetization; the former is known as the Einstein-de 
Haas method, the latter the Barnett method. From these experiments one can derive the 
relevant g-value of the specimen which, in each case, turns out to be very close to 2; this, 
as we know, pertains to the electron spin. Therefore, in discussing the problem of ferro¬ 
magnetism, we may specifically take: p — 2jip > J\s(s+ 1 )], where .s' is the quantum number 
associated with the electron spin. With s = |, only two orientations are possible for each 
lattice site, namely s z = +\ (with p z = +ps) and s z = (with p z = —p B ). The whole lattice 
is then capable of 2 N configurations; one such configuration is shown in Figure 12.5. 

We now consider the nature of the interaction energy between two neighboring spins 
Si and sj. According to quantum mechanics, this energy is of the form Kp ±/y, where the 
upper sign applies to “antiparallel” spins (S = 0) and the lower sign to “parallel” spins 
(S = 1). Here, Ky is the direct or Coulomb energy between the two spins, while /y is the 



FIGURE 12.4 Spontaneous magnetization of iron, nickel, and cobalt as a function of temperature. Theoretical 
curves are based on the Weiss theory of ferromagnetism. 
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FIGURE 12.5 One of the 2 N possible configurations of a system composed of N spins; here, N = 25. 


exchange energy between them: 

Ktj = f f* (1) V'/ (2) My ^7 (2) Vr/(D dr, dr 2 , (1) 

while 

Jij = f f*(\)f*( 2 )u ij i, j ( 2 )f i (l)dz\dT 2l (2) 

Uij being the relevant interaction potential. The energy difference between a state of 
“parallel” spins and one of “antiparallel” spins is given by 


e tt — e tt — — 2/y. 


(3) 


If Jij > 0, the state is energetically favored against the state ; we then look for the 
possibility of ferromagnetism. If, on the other hand, Jij < 0, the situation is reversed and we 
see the possibility of antiferromagnetism. 

It seems useful to express the interaction energy of the two states, ft and 1by a single 
expression; for this, we consider the eigenvalues of the scalar product 



= ^S(S+1)-5(5+1), 


(4) 


which equals + f if S — 1 and -1 if S = 0. We may, therefore, write for the interaction energy 
of the spins i and j 


Sij = const. - 2 Jij(Si ■ sj), (5) 

which is consistent with the energy difference (3) . The precise value of the constant here is 
immaterial because the potential energy is arbitrary to the extent of an additive constant 
anyway. Typically, the exchange interaction f j falls off rapidly as the separation of the two 
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spins is increased. To a first approximation, therefore, we may regard Jy as negligible for all 
but nearest-neighbor pairs (for which its value may be denoted by a common symbol /). 
The interaction energy of the whole lattice is then given by 

E = const. — 2J^2( s i' s j)’ i 6 ) 

n.n. 

where the summation goes over all nearest-neighbor pairs in the lattice. The model based 
on expression (6) for the interaction energy of the lattice is known as the Heisenberg model 
(1928). 

A simpler model results if we use, instead of (6), a truncated expression in which the 
product (Si ■ sj), which is equal to the sum (Si x s jx + Si y Sj y + Si Z Sj Z ), is replaced by a single term 
Si z Sj z , one reason for adopting this simpler model is that it does not necessarily require a 
quantum-mechanical treatment (because all the variables in the truncated expression for 
E commute). Expression (6) may now be written as 

E = const. — / ^2 a i a i< (7) 

n.n. 

where the new symbol cr* (or oj) = +1 for an “up” spin and —1 for a “down” spin; note 
that, with the introduction of the new symbol, we still have: = —2/. The model 

based on expression (7) is known as the Ising model ; it originated with Lenz (1920) and 
was subsequently investigated by his student Ising (1925) . 7 

A different model results if we suppress the z-components of the spins and retain the 
x- and y-components instead. This model was originally introduced by Matsubara and 
Matsuda (1956) as a model of a quantum lattice gas, with possible relevance to the super¬ 
fluid transition in liquid He 4 . The critical behavior of this so-called XY model has been 
investigated in detail by Betts and coworkers, who have also emphasized the relevance of 
this model to the study of insulating ferromagnets (see Betts et al., 1968 - 1974). 

It seems appropriate to regard the Ising and the XY models as special cases of an 
anisotropic Heisenberg model with interaction parameters J x ,Jy, and J z ; while the Ising 
model represents the situation J x ,J y « J z , the XY model represents just the opposite. Intro¬ 
ducing a parameter n, which denotes the number of spin components entering into the 
Hamiltonian of the system, we may regard the Ising, the XY, and the Heisenberg models 
as pertaining to the n-values 1, 2, and 3, respectively. As will be seen later, the parameter 
n, along with the dimensionality d of the lattice, constitutes the basic set of elements that 
determine the qualitative nature of the critical behavior of a given system. For the time 
being, though, we confine our attention to the Ising model, which is not only the sim¬ 
plest one to analyze but also unifies the study of phase transitions in systems as diverse as 
ferromagnets, gas-liquids, liquid mixtures, binary alloys, and so on. 

7 For an historical account of the origin and development of the Lenz-Ising model, see the review article by Brush 
(1967). This review gives a large number of other references as well. 
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To study the statistical mechanics of the Ising model, we disregard the kinetic energy 
of the atoms occupying the various lattice sites, for the phenomenon of phase transitions 
is essentially a consequence of the interaction energy among the atoms; in the interac¬ 
tion energy again, we include only the nearest-neighbor contributions, in the hope that 
the farther-neighbor contributions would not affect the results qualitatively. To fix the 
^-direction, and to be able to study properties such as magnetic susceptibility, we subject 
the lattice to an external magnetic field B, directed “upward”; the spin a ; - then possesses 
an additional potential energy -pBoj. a The Hamiltonian of the system in configuration 
{eri, <T 2 , ■ ■ ■, ct]v} is then given by 


H{oi} = -jJ2 a i a j ~ 

n.n. i 


( 8 ) 


and the partition function by 


°1 °2 °N 

= I]I]'I] ex P PJ'Ewj + PuB^ai 

°1 °2 °N L n.n. i 


(9) 


The Helmholtz free energy, the internal energy, the specific heat, and the net magnetiza¬ 
tion of the system then follow from the formulae 


A(B, T) = -kT\nQ N (B, T ), (10) 


U(B,T) = -T 2 ^ f (^j=kT 2 ^ f \nQ N , (11) 



Obviously, the quantity M(0, T) gives the spontaneous magnetization of the system; if it 
is nonzero at temperatures below a certain critical temperature T c , the system would be 
ferromagnetic for T <T C and paramagnetic for T > T c . At the transition temperature itself, 
the system is expected to show some sort of a singular behavior. 

It is obvious that the energy levels of the system as a whole will be degenerate, in the 
sense that the various configurations {ad will not all possess distinct energy values. In 
fact, the energy of a given configuration does not depend on the detailed values of all 


'Henceforth, we use the symbol /r instead of /ig. 
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the variables of, it depends only on a few numbers such as the total number N+ of “up” 
spins, the total number N ++ of “up-up” nearest-neighbor pairs, and so on. To see this, 

we define certain other numbers as well: 1V_ as the total number of “down” spins, N _as 

the total number of “down-down” nearest-neighbor pairs, and A/+_ as the total number 
of nearest-neighbor pairs with opposite spins. The numbers N+ and AC must satisfy the 
relation 


AT + +AC=AT. (14) 

And if q denotes the coordination number of the lattice, that is, the number of nearest 
neighbors for each lattice site, 9 then we also have the relations 

qN + = 2N ++ +N+_, (15) 

qN_ = 2AC_+A/+_. (16) 

With the help of these relations, we can express all our numbers in terms of any two of 
them, say AC andA/+ + . Thus 

N_=N-N+, N + _=qN + -2N ++t N—= ~qN- qN++ N++; (17) 

it will be noted that the total number of nearest-neighbor pairs of all types is given, quite 
expectedly, by the expression 


N++ + N _+ A/+_ = -qN. (18) 

Naturally, the Hamiltonian of the system can also be expressed in terms of N+ and N ++ ; 
we have from (8), with the help of the relations established above, 


H n (N+,N ++ ) = -/(, N ++ + N — - A/+_) - tiB(N + - AC) 

= -/Qf JN - 2qN+ + 4AT++^ - qB(2N+ - AT). (19) 

Now, let g,v (AC, N ++ ) be “the number of distinct ways in which the N spins of the lattice 
can be so arranged as to yield certain preassigned values of the numbers N+ and N ++ .” The 
partition function of the system can then be written as 

Qn(B,T)= XI gN(N + ,N ++ )exp{-pH N (N + ,N ++ )}, (20) 

N+,N ++ 

9 The coordination number q for a linear chain is obviously 2; for two-dimensional lattices, namely honeycomb, 
square, and triangular, it is 3, 4, and 6, respectively; for three-dimensional lattices, namely simple cubic, body-centered 
cubic, and face-centered cubic, it is 6, 8, and 12, respectively. 
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that is, 


e-f> A = J2 e -2 m-^m + J2'gN(N + ,N ++ )e 4 f >IN ++ l (21) 

N + =0 N ++ 

where the primed summation in (21) goes over all values of N ++ that are consistent 
with a fixed value of N + and is followed by a summation over all possible values of N + , 
that is, from N + — 0 to N + — N. The central problem thus consists in determining the 
combinatorial function g ; v(/V_, /V_ + ) for the various lattices of interest. 


12.4 The lattice gas and the binary alloy 

Apart from ferromagnets, the Ising model can be readily adapted to simulate the behavior 
of certain other systems as well. More common among these are the lattice gas and the 
binary alloy. 

The lattice gas 

Although it had already been recognized that the results derived for the Ising model would 
apply equally well to a system of “occupied” and “unoccupied” lattice sites (i.e., to a system 
of “atoms” and “holes” in a lattice), it was Yang and Lee (1952) who first used the term 
“lattice gas” to describe such a system. By definition, a lattice gas is a collection of atoms, 
N a in number, that can occupy only discrete positions in space — positions that constitute 
a lattice structure with coordination number q. 

Each lattice site can be occupied by at most one atom, and the interaction energy 
between two occupied sites is nonzero, say —e 0 , only if the sites involved constitute a 
nearest-neighbor pair. The configurational energy of the gas is then given by 

E = —£q Naci) (1) 

where N aa is the total number of nearest-neighbor pairs (of occupied sites) in a given con¬ 
figuration of the system. Let gN(N a ,N a a) denote “the number of distinct ways in which 
the N a atoms of the gas, assumed indistinguishable, can be distributed among the N sites 
of the lattice so as to yield a certain preassigned value of the number N aa .” The partition 
function of the system, neglecting the kinetic energy of the atoms, is then given by 

QiVfl (N, D = Egv (Na, Naa) e^ Naa , (2) 

Naa 

where the primed summation goes over all values of N aa that are consistent with the given 
values of N a and At; clearly, the number N here plays the role of the “total volume” available 
to the gas. 
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Going over to the grand canonical ensemble, we write for the grand partition function 
of the system 


N 


mz.N, T)= £ z Na Q Na (N,T). 


Nn =0 


The pressure P and the mean number N a of the atoms in the gas are then given by 


e PPN _ Y^ a J2gN ( Na.Naa) e 1 

T _n rJnn 




N a =0 


(3) 


(4) 


and 


N a _ 

X _ z | 

7 3 P 

N = 

~ v kT ' 

k dz 


(5) 


here, v denotes the average volume per particle of the gas (measured in terms of the 
“volume of a primitive cell of the lattice”). 

To establish a formal correspondence between the lattice gas and the ferromagnet, we 
compare the present formulae with the ones established in the preceding section — in par¬ 
ticular, formula (4) with formula (12.3.21). The first thing to note here is that the canonical 
ensemble of the ferromagnet corresponds to the grand canonical ensemble of the lattice 
gas! The rest of the correspondence is summarized in the following chart: 


where 


The lattice gas 


N a ,N-N a 

e 0 

z 

P 



The ferromagnet 


N+,N-N + (=N _) 


4/ 


exp{—2j6 (qJ-fiB)} 



qj — fiB 


AT± 

N 


1 

2 



M = n(N+-N-) = /j.(2N+ - N). 

We also note that the ferromagnetic analogue of formula (5) would be 

N+ _ 1 | d(A/N+^qJ-fiB) 1 _ t r i / M \ 

~N ~ kJ | 2,63 (qJ-nB) J “ 2 \9S/r + 


( 6 ) 


( 7 ) 
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which, by equation (12.3.13), assumes the expected form 



( 8 ) 


It is quite natural to ask: does lattice gas correspond to any real physical system in 
nature? The immediate answer is that if we let the lattice constant tend to zero (thus going 
from a discrete structure to a continuous one) and also add, to the lattice-gas formulae, 
terms corresponding to an ideal gas (namely, the kinetic energy terms), then the model 
might simulate the behavior of a gas of real atoms interacting through a delta function 
potential. A study of the possibility of a phase transition in such a system may, therefore, 
be of some value in understanding phase transitions in real gases. The case so > 0, which 
implies an attractive interaction among the nearest neighbors, has been frequently cited 
as a possible model for a gas-liquid transition. 

On the other hand, if the interaction is repulsive (so < 0), so that configurations with 
alternating sites being “occupied” and “unoccupied” are the more favored ones, then we 
obtain a model that arouses interest in connection with the theory of solidification; in 
such a study, however, the lattice constant has to stay finite. Thus, several authors have 
pursued the study of the antiferromagnetic version of this model, hoping that this might 
throw some light on the liquid-solid transition. For a bibliography of these pursuits, see 
the review article by Brush (1967). 

The binary alloy 

Much of the early activity in the theoretical analysis of the Ising model was related to the 
study of order-disorder transitions in alloys. In an alloy — to be specific, a binary alloy — 
we have a lattice structure consisting of two types of atoms, say 1 and 2, numbering N\ 
and IV 2 , respectively. In a configuration characterized by the numbers Ain, N22, and N12 of 
the three types of nearest-neighbor pairs, the configurational energy of the alloy may be 
written as 


E = en TVi 1 + S22N22 + 612^12 


(9) 


where en, £ 22 . and £12 have obvious meanings. As in the case of a ferromagnet, the various 
numbers appearing in the expression for E may be expressed in terms of the numbers 
N,N\, and 1 (of which only Ain is variable here). Equation (9) then takes the form 



-^£22AT + g(£i2 -£22Wi + (£11 +£22 - 2fil2)Ain. 


(10) 
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The correspondence between this system and the lattice gas is now straightforward: 

The lattice gas The binary alloy 

N a ,N-N a Ni,N-Ni{=N 2 ) 

-eo (sii+£ 22 - 2 ei 2 ) 

A A — \qe22N — Cj(si 2 — S 22 )Nl 

The correspondence with a ferromagnet can be established likewise; in particular, this 
requires that en = £22 = -/ and £12 = +/. 

At absolute zero, our alloy will be in the state of minimum configurational energy, 
which would also be the state of maximum configurational order. We expect that the 
two types of atoms would then occupy mutually exclusive sites, so that one might 
speak of atoms 1 being only at sites a and atoms 2 being only at sites b. As tempera¬ 
ture rises, an exchange of sites results and, in the face of thermal agitation, the order 
in the system starts giving way Ultimately, the two types of atoms get so “mixed up” 
that the very notion of the sites a being the “right” ones for atoms 1 and the sites 
b being the “right” ones for atoms 2 break down; the system then behaves, from the 
configurational point of view, as an assembly of Ni + N 2 atoms of essentially the same 
species. 


12.5 Ising model in the zeroth approximation 

In 1928 Gorsky attempted a statistical study of order-disorder transitions in binary alloys 
on the basis of the assumption that the work expended in transferring an atom from 
an “ordered” position to a “disordered” one (or, in other words, from a “right” site to 
a “wrong” one) is directly proportional to the degree of order prevailing in the system. 
This idea was further developed by Bragg and Williams (1934, 1935) who, for the first 
time, introduced the concept of long-range order in the sense we understand it now 
and, with relatively simple mathematics, obtained results that could explain the main 
qualitative features of the relevant experimental data. The basic assumption in the Bragg- 
Williams approximation was that the energy of an individual atom in the given system 
is determined by the ( average ) degree of order prevailing in the entire system rather 
than by the ( fluctuating ) configurations of the neighboring atoms. In this sense, the 
approximation is equivalent to the mean molecular field (or the internal field) theory of 
Weiss, which was put forward in 1907 to explain the magnetic behavior of ferromagnetic 
materials. 

It seems natural to call this approximation the zeroth approximation, for its features 
are totally insensitive to the detailed structure, or even to the dimensionality, of the lattice. 
We expect that the results following from this approximation will become more reliable 
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as the number of neighbors interacting with a given atom increases (i.e., as q -»• oo), thus 
diminishing the importance of local, fluctuating influences. 10 

We now define a long-range order parameter I in a given configuration by the very 
suggestive relationship 



i 


(-1 < L <+ 1 ), 


( 1 ) 


which gives 

N N 

N+ — — (1 + L) and 1V_ = -(1-L). (2) 

The magnetization M is then given by 


M = (N+ — = NfiL (—N[i<M<+N[i); (3) 

the parameter L is, therefore, a direct measure of the net magnetization in the system. For 
a completely random configuration, N+ = A/_ = ^N; the expectation values of both L and 
M are then identically zero. 

Now, in the spirit of the present approximation, we replace the first part of the Hamilto¬ 
nian (12.3.8) by the expression JT 07 , that is, for a given < 77 , we replace each of the 

qaj by a while the factor \ is included to avoid duplication in the counting of the nearest- 
neighbor pairs. Making use of equation (1), and noting that a = L, we obtain for the total 
configurational energy of the system 

E = ~\ NL ~ (fj-B)NL. (4) 

The expectation value of E is then given by 

U=- X - qJNL 2 - 11 BNL. (5) 

In the same approximation, the difference Ae between the overall configurational energy 
of an “up” spin and the overall configurational energy of a “down" spin — specifically, 
the energy expended in changing an “up” spin into a “down” spin — is given by, see 
equation (12.3.8), 


As = —J(qo)Ao — fiBAa 

= 2 n^-W + Bj, (6) 

10 In connection with the present approximation, we may as well mention that early attempts to construct a theory of 
binary solutions were based on the assumption that the atoms in the solution mix randomly. One finds that the results 
following from this assumption of random mixing are mathematically equivalent to the ones following from the mean 
field approximation ; see Problems 12.12 and 12.13. 
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for here Act = —2. The quantity qjo/p thus plays the role of the internal (molecular ) field 
of Weiss; it is determined by (i) the mean value of the long-range order prevailing in the 
system and by (ii) the strength of the coupling, qj, between a given spin i and its q nearest 
neighbors. The relative values of the equilibrium numbers N+ and 1V_ then follow from 
the Boltzmann principle, namely 

N-/N+ = exp(— A e/kT) = exp{—2 fj.(B' + B)/kT\ , (7) 

where B' denotes the internal molecular held: 

B' = qjo/n = q] (m/AT/t 2 ) . (8) 

Substituting (2) into (7), and keeping in mind equation (8), we obtain for L 

-—£ = exp{-2 (qJL +iiB)/kT] (9) 

1 T L 

or, equivalently, 


qjL ~T llB 1 1-TT i — 

- In —^ = tanh _1 L. (10 

kT 2 l — L 

To investigate the possibility of spontaneous magnetization, we let B-> 0, which leads 
to the relationship 

Io=tanh ( q w)- (11) 

Equation (11) may be solved graphically; see Figure 12.6. For any temperature T, the 
appropriate value of Lq(T) is determined by the point of intersection of (i) the straight 
line y — Lo and (ii) the curve y — lanhfq/Lo /kT ). Clearly, the solution L 0 = 0 is always there; 
however, we are interested in nonzero solutions, if any. For those, we note that, since the 
slope of the curve (ii) varies from the initial value qj/kT to the final value zero while the 
slope of the line (i) is unity throughout, an intersection other than the one at the origin is 
possible if, and only if, 


qJ/kT > 1, 


( 12 ) 


that is, 


T < qj/k = T c , say. (13) 

We thus obtain a critical temperature T c , below which the system can acquire a nonzero 
spontaneous magnetization and above which it cannot. It is natural to identify T c with 
the Curie temperature of the system — the temperature that marks a transition from the 
ferromagnetic to the paramagnetic behavior of the system or vice versa. 

It is clear from Figure 12.6, as well as from equation (11), that if L 0 is a solution of the 
problem, then — L 0 is also a solution. The reason for this duplicity of solutions is that, in 
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tanh (L 0 T C /T),L 0 



FIGURE 12.6 The graphical solution of equation (11), with T c = qj/k. 


the absence of an external field, there is no way of assigning a “positive,” as opposed to a 
“negative,” direction to the alignment of spins. In fact, if B were zero right from the begin¬ 
ning, then the positive solution of equation (11) would be as likely to occur as the negative 
one—with the result that the true expectation value oiLo(T) would be zero. If, on the other 
hand, B were nonzero to begin with (to be definite, say B > 0), then equation (10) for L(B, T ) 
would admit only positive solutions and, in the limit B -*■ 0+, we would obtain a positive 
Lq(T). The “up-down symmetry” will then be broken and we will see a net alignment of 
spins in the “up” direction. 11 

The precise variation of Lq(T) with T can be obtained by solving equation (11) numeri¬ 
cally; the general trend, however, can be seen from Figure 12.6. We note that, at T — 
qj/k (— T c ), the straight line y — Lq is tangential to the curve y — Vdnh( qJLo/kT) at the ori¬ 
gin; the relevant solution then is Lq(T c ) = 0. As T decreases, the initial slope of the curve 
becomes larger and the relevant point of intersection moves rapidly away from the origin; 
accordingly, To(T) rises rapidly as T decreases below T c . To obtain an approximate depen¬ 
dence of Lo(T) on T near T = T c , we write (11) in the form Lq — tanh(Lo7c/T) and use the 
approximation tanhx ~ x - x 3 /S, to obtain 

Lo(T) * (3(1 - T/Tc)} 1 ' 2 (T < T C ,B —>■ 0). (14) 

On the other hand, as T -> 0,Lo -> 1, in accordance with the asymptotic relationship 

L 0 (T)^l-2exp(-2T c /T) {(T/T c )« 1}. (15) 

Figure 12.7 shows a plot of Lq(T) versus T, along with the relevant experimental results for 
iron, nickel, cobalt, and magnetite; we find the agreement not too bad. 

u The concept of “broken symmetry” plays a vital role in this and many other phenomena in physics; for details, see 
Fisher (1972) and Anderson (1984). 
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FIGURE 12.7 The spontaneous magnetization of a Weiss ferromagnet as a function of temperature. The 
experimental points (after Becker) are for iron (x), nickel (o), cobalt (A), and magnetite (+). 


The field-free configurational energy and the field-free specific heat of the system are 
given by, see equation (5), 


U 0 CO = -\qJNLl 


(16) 


and 


Co CO — —qJNLo 


cILq 

~dT 


NkL 2 0 

(T/T c ) 2 /(1-I 2 0 )-T/T c ’ 


(17) 


where the last step has been carried out with the help of equation (11). Thus, for all T > T c , 
both U()(T) and C'o(T) are identically zero. However, the value of the specific heat at the 
transition temperature T c , as approached from below, turns out to be, see equations (14) 
and (17), 


Co(Tc-) = Lim 
x-^0 


Nk ■ 3x 




= -Nk. 
2 


(18) 


The specific heat, therefore, displays a discontinuity at the transition point. On the 
other hand, as T -> 0, the specific heat vanishes, in accordance with the formula, see 
equations (15) and (17), 


CoCO: 


■ANk[ — 


exp(—2 T c /T). 


(19) 


The full trend of the function Co(T) is shown in Figure 12.8. 

ft is important to note that the vanishing of the configurational energy and the spe¬ 
cific heat of the system at temperatures above T c is directly related to the fact that, in the 
present approximation, the configurational order prevailing in the system at lower temper¬ 
atures is completely wiped out as T -»• T c . Consequently, the configurational entropy and 
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FIGURE 12.8 The field-free specific heat of a Weiss ferromagnet as a function of temperature. 


the configurational energy of the system attain their maximum values at T = T c \ beyond 
that, the system remains thermodynamically “inert.” As a check, we evaluate the con¬ 
figurational entropy of the system at T — T c ; with the help of equations (11) and (17), 
we get 


So (7c) — 


J C 0 (T)dT 
o 



= Nk J(tanh 1 Lq)cILq = M;ln2, (20) 

o 

precisely the result we expect for a system capable of 2 N equally likely microstates. 12 The 
fact that all these microstates are equally likely to occur is again related to the fact that for 
T >T C there is no (configurational) order in the system. 

We now proceed to study the magnetic susceptibility of the system. Using equa¬ 
tion (10), we get 


X(B,T) = 



Nfi 2 1 -L 2 (B,T) 
k T-T c {1-L 2 (B,T)} 


( 21 ) 


For L <$: 1 (which is true at high temperatures for a wide range of B but is also true near T c 
if B is small), we obtain the Curie-Weiss law 


Xo {T)^(Nti 2 /k)(T-T c )- 1 (T>T Ct B^ 0), 
12 Recall equation (3.3.14), whereby S = fclnQ. 


(22a) 
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which may be compared with the Curie law derived earlier for a paramagnetic system; see 
equation (3.9.12). For T less than, but close to, T c we utilize equation (14) as well and get 

XoC T) « (Nq 2 /2k)(T c - T)' 1 (T <T C ,B -+ 0). (22b) 

Experimentally, one finds that the Curie-Weiss law is satisfied with considerable accuracy, 
except that the empirical value of T c thus obtained is always somewhat larger than the true 
transition temperature of the material; for instance, in the case of nickel, the empirical 
value of T c obtained in this manner turns out to be about 650 K while the actual transition 
takes place at about 631 K. In passing, we add that, as T -+ 0, the low-held susceptibility 
vanishes, in accordance with the formula 

4Nu 2 

Xo(T) ~ -j£- exp(—2 T c /T). (23) 

Finally, we examine the relationship between L and B at T = T c . Using, once again, 
equation (10) and employing the approximation tanh -1 x ~ x + x 3 /3, we get 

L ~ (3qB/kT c ) 1/3 (T = T C ,B -> 0). (24) 

At this point we wish to emphasize the remarkable similarity that exists between the 
critical behavior of a gas-liquid system obeying van der Waals equation of state and that 
of a magnetic system treated in the Bragg-Williams approximation. Even though the two 
systems are physically very different, the level of approximation is such that the exponents 
governing power-law behavior of the various physical quantities in the critical region turn 
out to be the same; compare, for instance, equation (14) with (12.2.11), equations (22a) and 
(22b) with (12.2.13) and (12.2.14), equation (24) with (12.2.8) — along with the behavior of 
the specific heat as well. This sort of similarity will be seen again and again whenever we 
employ an approach similar in spirit to the mean field approach of this section. 

Before we close our discussion of the so-called zeroth approximation, we would like to 
demonstrate that it corresponds exactly to the random mixing approximation (which was 
employed originally in the theory ofbinary solutions). According to equation (12.3.19), the 
mean configurational energy in the absence of the external field is given by 

U 0 = -j(yiN~ 2 qN+ + 4A++J. (25) 

At the same time, equations (2) and (16) of the present approach give 

N+ = ilV(l+Zo) and U 0 =-^qJNL 2 Q . (26) 

Combining (25) and (26), we obtain 

N++ = ^qN(l + I 0 ) 2 , 


(27) 
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so that 


N++ (N+\ 2 

\ qN \n) • 


(28) 


Thus, the probability of having an “up-up” nearest-neighbor pair of spins in the lat¬ 
tice is precisely equal to the square of the probability of having an “up” spin; in other 
words, there does not exist, in spite of the presence of a nearest-neighbor interaction 
(characterized by the coupling constant /), any specific correlation between the neigh¬ 
boring spins of the lattice. Put differently, there does not exist any short-range order 
in the system, apart from what follows statistically from the long-range order (charac¬ 
terized by the parameter L). It follows that, in the present approximation, our system 
consists of a specific number of “up” spins, namely N(l + L)/2, and a corresponding 
number of “down” spins, namely N(l— L)/2, distributed over the N lattice sites in a 
completely random manner — similar to the mixing of N( \ + L)/2 atoms of one kind 
with N(l — L)/2 atoms of another kind in a completely random manner to obtain a 
binary solution of N atoms; see also Problem 12.4. For this sort of mixing, we obviously 
have 


N ++ = l -qN 




N + _ = 2--qN\ 


¥)(¥)■ 


with the result that 


A4+Ak_ _ 1 
(N+-) 2 ~4 ' 


(29a) 

(29b) 


(30) 


12.6 Ising model in the first approximation 

The approaches considered in the preceding section have a natural generalization toward 
an improved approximation. The mean field approach leads naturally to the Bethe 
approximation (Bethe, 1935; Rushbrooke, 1938), which treats the interaction of a given 
spin with its nearest neighbors somewhat more accurately. The random mixing approach, 
on the other hand, leads to the quasichemical approximation (Guggenheim, 1935; Fowler 
and Guggenheim, 1940), which takes into account the specific short-range order of the lat¬ 
tice — over and above the one that follows statistically from the long-range order. As shown 
by Guggenheim (1938) and by Chang (1939), the two methods yield identical results for the 
Ising model. It seems worthwhile to mention here that the extension of these approxima¬ 
tions to higher orders, or their application to the Heisenberg model, does not produce 
identical results. 
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In the Bethe approximation, a given spin oq is regarded as the central member of a 
group, which consists of this spin and its q nearest neighbors, and in writing down the 
Hamiltonian of this group the interaction between the central spin and its q neighbors is 
taken into account exactly while the interaction of these neighbors with other spins in the 
lattice is taken into account through a mean molecular field B'. Thus 

q q 

H q+ 1 = -hBoq- /r(B + 5 , )E 0 )'-/E <T ° cr /’ (1) 

1 = 1 7=1 

B being the external magnetic held acting on the lattice. The internal held B' is determined 
by the condition of self-consistency, which requires that the mean value, a 0 , of the central 
spin be the same as the mean value, Wj, of any of the q neighbors. The partition function Z 
of this group of spins as a whole is given by 


Z = J2 ex P 

cr 0 ,Oj=±l 


1 

IcT 


q q 

hBoq + n(B + B')y^ai+jy^0 O ai 

7=1 7=1 


= E exp 

CT o>°7 =± i 


q q 

ao 0 + (a + a') E a j + Y E CT °°7 
7=1 7=1 


( 2 ) 


where 


a = 


fiB 

IcT’ 



and 



(3) 


Now, the right side of (2) can be written as a sum of two terms, one pertaining to cto = +1 
and the other to a 0 = — 1, that is, 


Z = Z++Z_, 


where 


Z± = E ex P 


r;=±l 


± 0 ! + (a + a' ± y ) E ] c 


7=1 


= e ±a [2cosh(a + </ ± y)] q ■ 

The mean value of the central spin is then given by 


Z+-Z_ 


(4) 


cr 0 = 


z 


(5) 
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while the mean value of any one of its q neighbors is given by, see (2) and (4), 



= — [Z + tanh (a + a' + y) + Z_ tanh (o t + a' — y)}. ( 6 ) 

Equating (5) and ( 6 ), we get 


Z+ {1 — tanh (a + a' + y )} = Z_ {1 + tanh (a + a' — y) }. (7) 

Substituting for Z + and Z_ from (4), we finally obtain 

e 2a' = j cosh (a + a 1 + y) 

I cosh (a + a' — y) 

Equation (8) determines a.' which, in turn, determines the magnetic behavior of the lattice. 

To study the possibility of spontaneous magnetization, we set a(— jiB/kT ) = 0. Equa¬ 
tion (8) then reduces to 



a 


q- 1 , f cosh(a' + y) 

-ini--- 

2 cosh {a' — y) 


0 ) 


In the absence of interactions (y = 0), a' is clearly zero. In the presence of interactions 
(y/ 0), a' may still be zero unless y exceeds a certain critical value, y c say. To determine 
this value, we expand the right side of (9) as a Taylor series around a r — 0, with the result 

a! = (q— l)tanhy I a' — sech 2 y H-i. (10) 


We note that, for all y, a' — 0 is one possible solution of the problem; this, however, does 
not interest us. A nonzero solution requires that 


C q — l)tanh y > 1 , 


that is, 


y > y c = tanh 


-l 


q-1 


= - In 
2 



In terms of temperature, this means that 


( 11 ) 




(12) 
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which determines the Curie temperature of the lattice. From (10), we also infer that for 
temperatures less than, but close to, the Curie temperature 


a' — 13cosh 2 Yc[(c/ — 1) tanhy — l] j 7 ~ {3(q -l)(y - y c )} 1/2 

H3) 


The parameter L, which is a measure of the long-range order in the system, is, by definition, 
equal to a. From equations (5) and (7), we get 

z _ (Z+/ZL) - 1 _ sinh(2« + 2a') 

(Z+/Z_) + 1 cosh(2a + 2a’) + exp(— 2y) ’ 1 

In the limit B -»• 0 (which means a —>• 0) and at temperatures less than, but close to, the 
Curie temperature (y > y c ;a' ~ 0), we obtain 


Lq 


sinh(2a') 

cosh(2o'0 + exp(—2y) 


2a' 

1 + (<7 — 2 )/q 


q 

q-l 


Substituting from (12) and (13), we get 


Lq — 





(15) 


(16) 


We note that, for q » 1, equations (12) and (16) reduce to their zeroth-order counter parts 
(12.5.13) and (12.5.14), respectively; in either case, as T -»• T c from below, L 0 vanishes as 
( T c — T) 1 / 2 . We also note that the spontaneous magnetization curve in the present approxi¬ 
mation has the same general shape as in the zeroth approximation; see Figure 12.7. Of 
course, in the present case the curve depends explicitly on the coordination number q, 
being steepest for small q and becoming less steep as q increases — tending ultimately to 
the limiting form given by the zeroth approximation. 

We shall now study correlations that might exist among neighboring spins in the lattice. 

For this, we evaluate the numbers N ++t N _, and N _in terms of the parameters a, a', 

and y, and compare the resulting expressions with the ones obtained under the mean field 
approximation. Carrying out summations in (2) over all the spins (of the group) except &o 
and cri, we obtain 


Z = 


2 - 


oo>°l 


=±1 


L ex p{ 


aero + {a ■ 




(17) 


Writing this as a sum of three parts pertaining, respectively, to the cases (i) ao = o\ = +1, 
(ii) ctq = cr\ = —1, and (iii) go = —o\ = ±1, we have 


Z — + Z — - 1 - Z- 1 _. 


(18) 
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where, naturally enough, 

N ++ : : N+_ :: Z++ : Z__ : Z+_. 

We thus obtain, using (8) as well, 

N ++ oc e (2 “ + “ ,+y ) {2cosh(a + a' 4- y)} ql , 

N _oc e (- 2Q, -“'+y) {2cosh(a + a' — y)} c/ 1 

= e (- 2 °'- 3 “' , +y){2cosh(a + a' + y) } q ~ 1 , 


and 


N-f _oc e ( “ y) {2cosh(a + </ + y)}^ 1 + e ( " y) {2cosh(a + a' — y)] cl 1 

= 2e ( ~ a ~ y> {2cosh(a + a' + 


(19) 


Normalizing these expressions with the help of the relationship 


N ++ +N__+N+^ = ^qN, 


we obtain the desired results 


_____ , i e 2a+2a'+y e ~2a-2a'+y 2e~ Y \ 

(N ++ ,N _,iV_i_) = -qN—, -----r 2 -, 

v ’ 2^ 2{ercosh(2a + 2a / ) + e-r} 


whereby 


( 20 ) 


( 21 ) 


N ++ N — = I e 4y = I e 4//tr_ (22) 

( N +-) 2 4 4 

The last result differs significantly from the one that followed from the random mixing 
approximation, namely (12.5.30). The difference lies in the extra factor exp(4//fcT) which, 
for/ > 0, favors the formation of parallel-spin pairs f f and f/, as opposed to antiparallel- 
spin pairs // and / /.In fact, one may regard the elementary process 

tt + wo 2fl, (23) 

which leaves the total numbers of “up” spins and “down” spins unaltered, as a kind of 
a “chemical reaction” which, proceeding from left to right, is endothermic (requiring an 
amount of energy 4/ to get through) and, proceeding from right to left, is exothermic 
(releasing an amount of energy 4/). Equation (22) then constitutes the law of mass action 
for this reaction, the expression on the right side being the equilibrium constant of the 
reaction. Historically, equation (22) was adopted by Guggenheim as the starting point 
of his “quasichemical” treatment of the Ising model; only later on did he show that his 
treatment was equivalent to the Bethe approximation expounded here. 
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Equation (22) tells us that, for / > 0, there exists among like neighbors (t and f or f 
and J,) a positive correlation and among unlike neighbors (f and f) a negative correlation, 
and that these correlations are a direct consequence of the nearest-neighbor interaction. 
Accordingly, there must exist a specific short-range order in the system, over and above the 
one that follows statistically from the long-range order. To see this explicitly, we note that 
even when long-range order disappears (a + a' = 0), some short-range order still persists. 
For instance, from equation (21) we obtain 


(N++,N_ 


N+-) 


1=0 


1 (eY,eY,2e~Y) 

-qN-—-—— - 

2 4 coshy 


(24) 


which, only in the limit y -> 0, goes over to the random-mixing result, see equa¬ 
tion (12.5.29) with L = 0, 


(AT ++ ,AC_,A4_) Z=0 = \qN^. (25) 

In the zeroth approximation, equation (25) is supposed to hold at all temperatures above 
T c ; we now find that a better approximation at these temperatures is provided by (24). 

Next, we evaluate the configurational energy Uo and the specific heat Co of the lattice 
in the absence of the external field (a — 0). In view of equation (12.5.25), 



1 


-qN- 2qN + + 4N ++ 


*=o 


(26) 


The expression for N ++ is given by equation (21) while that for N + can be obtained 
from (14): 


(N + ) a= o = — JV(1 + Lq) = —N 


exp(2a') + exp(—2y) 
cosh(2o'0 + exp(—2 y) ’ 


Equation (26) then gives 


(27) 


1 cosh(2<*') — exp(—2 y) 

2 q/N cosh( 2 a /) _|_ e xp(— 2 y) ’ 


(28) 


where a' is determined by equation (9) . For T > T c , a' = 0, so 



1 - exp(-2y) 
1 + exp(—2 y) 



qJN tanh y. 


(29) 


Obviously, this result arises solely from the short-range order that persists in the system 
even above T c . As for the specific heat, we get 


C 0 /Nk=^qy 2 sech 2 y (T > T c ). (30) 

As T -y oo, Co vanishes like T~ 2 . We note that a nonzero specific heat above the transition 
temperature is a welcome feature of the present approximation, for it brings our model 
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FIGURE 12.9 The field-free specific heat of an Ising lattice with coordination number 4. Curve 1 obtains in the 
Bethe approximation, curve 2 in the Bragg-Williams approximation. 

somewhat closer to real physical systems. In this connection, we recall that in the previous 
approximation the specific heat was zero for all T > T c . Figure 12.9 shows the specific heat 
of an Ising lattice, with coordination number 4, as given by the Bethe approximation; for 
comparison, the result of the previous approximation is also included. 

We are now in a position to study the specific heat discontinuity at T — T c . The limiting 
value of Co, as T approaches T c from above, can be obtained from equation (30) by letting 
Y -*■ y c ■ One obtains, with the help of equation (11), 



(31) 


To obtain the corresponding result as T approaches T c from below, we must use the general 
expression (28) for Uq, with «' —»• 0 as y —»■ y c . Expanding (28) in powers of the quantities 
(y - Yc) and a', and making use of equation (13), we obtain for (1 - T/T c ) < 1 



a'* + ■ • • 



(32) 


Differentiating with respect to T and substituting for T c , we obtain 



(33) 


which is (3q-2) times larger than the corresponding result for T=T C +; compare 
with equation (31). The specific-heat discontinuity at the transition point is, therefore, 
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given by 



2 


(34) 


One may check that, for q ;» 1, the foregoing results go over to the ones following from the 
zeroth approximation. 

Finally, we examine the relationship between L and B at T = T c . Using equations (8) and 
(14), with both a and a'«l while y — y c , we get 



(35) 


compare with equation (12.5.24). For the behavior of /o» see Problem 12.16. 

In passing, we note that, according to equation (12), the transition temperature for a 
lattice with q = 2 is zero, which essentially means that a one-dimensional Ising chain does 
not undergo a phase transition. This result is in complete agreement with the one follow¬ 
ing from an exact treatment of the one-dimensional lattice; see Section 13.2. In fact, for 
a lattice with q — 2, any results following from the Bethe approximation are completely 
identical with the corresponding exact results (see Problem 13.3); on the other hand, the 
Bragg-Williams approximation is least reliable when q — 2. 

That T c for q — 2 is zero (rather than 2 J/k) is in line with the fact that, for all q, the first 
approximation yields a transition temperature closer to the correct value of T c than does 
the zeroth approximation. The same is true of the amplitudes that determine the quan¬ 
titative behavior of the various physical quantities near T = T c , though the exponents in 
the various power laws governing this behavior remain the same; compare, for instance, 
equation (16) with (12.5.14), equation (35) with (24) as well as the behavior of the spe¬ 
cific heat. In fact, one finds that successive approximations of the mean field approach, 
while continuing to improve the theoretical value of T c and the quantitative behavior of 
the various physical quantities (as given by the amplitudes), do not modify their quali¬ 
tative behavior (as determined by the exponents). For an account of the higher-order 
approximations, see Domb (1960). 

One important virtue of the Bethe approximation is that it brings out the role of the 
dimensionality of the lattice in bringing about a phase transition in the system. The fact 
that T c — 0 for q — 2 and thereon it increases steadily with q leads one to infer that, while a 
linear Ising chain does not undergo phase transition at any finite T, higher dimensionality 
does promote the phenomenon. One may, in fact, argue that the absence of a phase tran¬ 
sition in a one-dimensional chain is essentially due to the fact that, the interactions being 
severely short-ranged, “communication” between any two parts of the chain can be com¬ 
pletely disrupted by a single defect in-between. The situation remains virtually unaltered 
even if the range of interactions is allowed to increase — so long as it remains finite. Only 
when interactions become truly long-ranged, with Jij ~ \ i - j\~ (l+cr) (o > 0), does a phase 
transition at a finite T become possible — but only if o < 1; for o > 1, we are back to the 
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case of no phase transition, while the borderline case a = 1 remains in doubt. For more 
details, see Griffiths (1972, pp. 89-94). 

Peierls (1936) was the first to demonstrate that at sufficiently low temperatures the fsing 
model in two or three dimensions must exhibit a phase transition. He considered the lat¬ 
tice as made up of two kinds of domains, one consisting of “up” spins and the other of 
“down” spins, separated by a set of boundaries between the neighboring domains, and 
argued on energy considerations that in a two- or three-dimensional lattice the long-range 
order that exists at OK would persist at finite temperatures. Again, for details, see Griffiths 
(1972, pp. 59-66). 


12.7 The critical exponents 

A basic problem in the theory of phase transitions is to study the behavior of a given system 
in the neighborhood of its critical point. We know that this behavior is marked by the fact 
that the various physical quantities pertaining to the system possess singularities at the 
critical point. It is customary to express these singularities in terms of power laws char¬ 
acterized by a set of critical exponents that determine the qualitative nature of the critical 
behavior of the given system. To begin with, we identify an order parameter m, and the cor¬ 
responding ordering field h, such that, in the limit h -*■ 0, m tends to a limiting value m 0 , 
with the property that m 0 — 0 for T > T c and / 0 for T < T c . For a magnetic system, the nat¬ 
ural candidate for m is the parameter L(= a) of Sections 12.5 and 12.6, while h is identified 
with the quantity pB/kT c \ for a gas-liquid system, one may adopt the density differential 
(Pi ~ Pc) or | p g - p c \ for m and the pressure differential (P - P c ) for h. The various critical 
exponents are then defined as follows. 

The manner in which mo ->• 0, as T ->• T c from below, defines the exponent ft: 

mo ~ (T c — T)P (h^0,T<T c ). (1) 

The manner in which the low-field susceptibility /a diverges, as T —»• T c from above (or 
from below), defines the exponent y (or y'): 

/ dm\ j (T-T c r y (h^0,T>T c ) (2a) 

\3hJ T h _^ 0 [ (7c _ T) -y' (/j o, T < T c ); (2b) 

in the gas-liquid transition, the role of xo is played by the isothermal compressibility, 
kt — p~ 1 (dp/dP)r, of the system. Next, we define an exponent S by the relation 

m\ T=Tc ~h lls (T = T Ct h 0); (3) 

in the case of a gas-liquid system, <5 is a measure of the “degree of flatness” of the critical 
isotherm at the critical point, for then 


IF-Pci 


T=T C 


\p — Pel " 5 (T — T C ,PP c ). 


(4) 
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Finally, we define exponents a and a' on the basis of the specific heat, Cy, of the gas-liquid 
system: 


UT-T c r a [T > T c ) (5a) 

V 1 (Tc-T)- 01 ' (T < T c ). (5b) 

In connection with the foregoing relations, especially equations (5) , we wish to empha¬ 
size that in certain cases the exponent in question is rather small in value; it is then more 
appropriate to write 

m-* _ i 

— (in«i). (6) 

A 

Now, if k > 0, the function/(f) would have a power-law divergence at t — 0; in case k -* 0, 
the function/(£) would have a logarithmic divergence instead: 

/(f)~ln(l/|f|) (|f|« 1). (7) 

In either case, the derivative/'(f) ~ 

A survey of the results derived in Sections 12.2 through 12.6 shows that for a gas-liquid 
system obeying van der Waals equation of state or for a magnetic system treated in the 
mean field approximation (it does not matter what order of approximation one is talking 
about), the various critical exponents are the same: 

1 

/?=-, y = Y =1, 5 = 3, a = a' = 0. (8) 

In Table 12.1 we have compiled experimental data on critical exponents pertaining 
to a variety of systems including the ones mentioned above; for completeness, we 
have included here data on another two exponents, v and which will be defined in 
Section 12.12. We find that, while the observed values of an exponent, say p, differ very 
little as one goes from system to system within a given category (or even from category 
to category), these values are considerably different from the ones following from the 
mean field approximation. Clearly, we need a theory of phase transitions that is basically 
different from the mean field theory. 

To begin with, some questions arise: 

(i) Are these exponents completely independent of one another or are they mutually 
related? In the latter case, how many of them are truly independent? 

(i i) On what characteristics of the given system do they depend? This includes the 
question why, for systems differing so much from one another, they differ so little. 

(i i i) How can they be evaluated from first principles? 

The answer to question (i) is simple: yes, the various exponents do obey certain rela¬ 
tions and hence are not completely independent. These relations appear in the form 
of inequalities, dictated by the principles of thermodynamics, which will be explored 
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Table 12.1 Experimental Data on Critical Exponents 


Critical 

Exponents 

Magnetic 

Systems* 3 * 

Gas-liquid 

Systems* 13 ) 

Binary Fluid 
Mixtures** 1 ) 

Binary 

Alloys* d ) 

Ferroelectric 

Systems* e ) 

Superfluid 

He 4 **) 

Mean Field 

Results 

a,a' 

0.0-0.2 

0.1-0.2 

0.05-0.15 

— 

— 

-0.026 

0 

p 

0.30-0.36 

0.32-0.35 

0.30-0.34 

0.305 ± 0.005 

0.33-0.34 

— 

1/2 

y 

1.2-1.4 

1.2-1.3 

1.2-1.4 

1.24 ± 0.015 

1.0 ± 0.2 

inaccessible 

1 

/ 

1.0-1.2 

1.1-1.2 

— 

1.23 ± 0.025 

1.23 ± 0.02 

inaccessible 

1 

s 

4.2-4.8 

4.6-5.0 

4.0-5.0 

— 

— 

inaccessible 

3 

V 

0.62-0.68 

— 

— 

0.65 ± 0.02 

0.5-0.8 

0.675 

1/2 

V 

0.03-0.15 

— 

— 

0.03-0.06 

— 

— 

0 


* a )Stierstadt et at (1990). 

^Voronel (1976); Rowlinson and Swinton (1982). 
WRowlinson and Swinton (1982). 

WAls-Nielsen (1976); data pertain to beta-brass only. 
^Kadanoff et at (1967); Lines and Glass (1977). 
( f >Ahlers(1980). 


in Section 12.8; in the modern theory of phase transitions, see Sections 12.10 through 
12.12 and Chapter 14, the same relations turn up as equalities, and the number of these 
(restrictive) relations is such that, in most cases only two of the exponents are truly 
independent. 

As regards question (ii), it turns out that our exponents depend on a very small number 
of characteristics, or parameters, of the problem, which explains why they differ so little 
from one system to another in a given category of systems (and also from one category 
to another, even though systems in those categories are so different from one another). 
The characteristics that seem to matter are (a) the dimensionality, d, of the space in which 
the system is embedded, (b) the number of components, n, of the order parameter of the 
problem, and (c) the range of microscopic interactions in the system. 

Insofar as interactions are concerned, all that matters is whether they are short-ranged 
(which includes the special case of nearest-neighbor interactions) or long-ranged. In the 
former case, the values of the critical exponents resulting from nearest-neighbor interac¬ 
tions remain unaltered — regardless of whether further-neighbor interactions are included 
or not; in the latter case, assuming Jq ~ \i—j\~ (d+a ' ) with a > 0, the critical exponents 
depend on o. Unless a statement is made to the contrary, the microscopic interactions 
operating in the given system will be assumed to be short-ranged; the critical exponents 
then depend only on d and n — both of which, for instructional purposes, may be treated 
as continuous variables. 

Insofar as d is concerned, we recall the Bethe approximation that highlighted the 
special role played by the dimensionality of the lattice through, and only through, the coor¬ 
dination number q. We also recall that, while the theoretical value of T c and the various 
amplitudes of the problem were influenced by q, the critical exponents were not. In more 
accurate theories we find that the critical exponents depend more directly on d and only 






438 Chapter 12 • Phase Transitions: Criticality, Universality, and Scaling 


indirectly on q; however, for a given d, they do not depend on the structural details of the 
lattice (including the number q ). 

Insofar as n is concerned, the major difference lies between the Ising model in — 1) 
with discrete symmetry (eq = +1 or -1) and other models in > 2) with continuous symme¬ 
try (-1 < &ia < +1 f° r a = 1,..., n, with <r ; j = 1). In the former case, T c is zero for d < 1 and 
nonzero for d > 1; in the latter, T c is zero for d < 2 and nonzero for d > 2. 13 In either case, 
the critical exponents depend on both d and n, except that for d > 4 they become indepen¬ 
dent of d and n, and assume values identical to the ones given by the mean field theory; 
the physical reason behind this overwhelming generality is examined in Section 12.13. In 
passing, we note that, for given d and n, the critical exponents do not depend on whether 
the spins constituting the system are treated classically or quantum-mechanically. 

As regards question (iii), the obvious procedure for evaluating the critical exponents 
is to carry out exact (or almost exact) analysis of the various models — a task to which 
the whole of Chapter 13 is devoted. An alternative approach is provided by the renormal¬ 
ization group theory, which is discussed in Chapter 14. A modest attempt to evaluate the 
critical exponents is made in Section 12.9, which yields results that are inconsistent with 
the experiment but teaches us quite a few lessons about the shortcomings of the so-called 
classical approaches. 


12.8 Thermodynamic inequalities 


The first rigorous relation linking critical exponents was derived by Rushbrooke (1963) 
who, on thermodynamic grounds, showed that for any physical system undergoing a phase 
transition 


(a' + 2/S + y') > 2. 


( 1 ) 


The proof of inequality (1) is straightforward if one adopts a magnetic system as an exam¬ 
ple. We start with the thermodynamic formula for the difference between the specific heat 
at constant field Ch and the specific heat at constant magnetization Cm (see Problem 3.40) 



( 2 ) 


Since Cm > 0, it follows that 



(3) 


Now, letting H -> 0 and T T c from below, we get 

Di(T c - T)~ a ' > D 2 T c (T c - D>'' +2( ^- 1) 


(4) 


13 The special case n = 2 with d = 2 is qualitatively different from others; for details, see Section 13.7. 
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where D\ andD 2 are positive constants; here, use has been made of power laws (12.7.1, 2b, 
and 5b). 14 Inequality (4) may as well be written as 

(T c - r) 2_( “' +2/s+/) > D 2 TJD l . (5) 


Since ( T c —T) can be made as small as we like, (5) will not hold if ( a' + 2+ y') < 2. The 
Rushbrooke inequality (1) is thus established. 

To establish further inequalities, one utilizes the convexity properties of the Helmholtz 
free energy A(T,M). Since dA = —SdT + HdM, 


and 






(6a, b) 



a 2 h\ _ / dH\ _ 1 > 
dM^) T ~ {dMj T ~ X ~ ‘ 


(7a, b) 


It follows that A(T,M) is concave in T and convex in M. We now proceed to establish the 
Griffiths inequality (1965a, b) 


a' + 0(5+1) >2. (8) 

Consider a magnetic system in zero field and at a temperature T\ < T c . Then, by (7a), 
A{T,M) is a function of T only, so we can write 


A{T\,M) = A(7i,0) (-Mi<M<Mi), (9) 

where Mi is the spontaneous magnetization at temperature Ti; see Figure 12.10. Applying 
(6a) to (9), we get 


S{T\,M) = S(7i,0) (—Mi < M < Mi). 


( 10 ) 


We now define two new functions 

A* (T,M) = [A{T,M) —A c } + (T— T C )S C (11) 


and 


S*(T,M) = S(T,M) — S c , (12) 

14 Recalling the correspondence between a gas-liquid system and a magnet, one might wonder why we have 
employed Ch, rather than Cm, in place of Cv■ The reason is that, since we are letting H -> 0 and T -*■ T C —,M -»■ 0 as 
well. So, as argued by Fisher (1967), in the limit considered here, Ch and Cm display the same singular behavior. In fact, 
it can be shown that if the ratio Cm/Ch —>■ 1 as T ->■ T c , then (a' + 2/3 + y') must be greater than 2; on the other hand, if 
this ratio tends to a value less than 1, then ( a 1 + 2/1 + y') = 2. For details, see Stanley (1971), Section 4.1. 
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M 



FIGURE 12.10 Magnetization, M(T,H), of a magnetic system for H > 0, H < 0, and H ->■ 0. Here, Mi denotes the 
spontaneous magnetization of the system at a temperature Ti < T c . 


where A c — A(T c , 0) and S c — S(T C , 0). It follows that 



3 2 A* \ 

3 T 2 J 

/ M 




(13a, b) 


Thus, A* is also concave in T. Geometrically, this means that, for any choice of T \, the curve 
A*(T), withM fixed at Mi, lies below the tangent line at T = T\, that is, 


A*(T,M\) <A*(Ti,Mi) + (- 7 ^) (T-Ti); 

\ / Mi,T=Ti 

see Figure 12.11. Letting T = T C in (14), we get 


(14) 


A*(T c ,Mi) S A*(Ti,M\) - S*(T lt Mi)(T c - Ti) 


(15) 


which, in view of equations (9) through (12), may be written as 

A^r c ,M 1 )<A^r 1 ,o)-^(ri,o)(r c -r 1 ). (16) 

Utilizing, once again, the concavity of the function A*(T) but this time at T — T c (with M 
fixed at zero and the slope ( dA*/8T ) vanishing), we get, see (14), 

A*(T,0) <A*(T c ,0). (17) 

Now, letting T — Ti in (17) and noting that A*(T c ,0) = 0 by definition, we get 


A*(7i,0) < 0. 


(18) 
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FIGURE 12.11 The function A*(T,M) of a magnetic system, with magnetization M fixed at Mi. The slope of this 
curve is S(T C , 0) — S(T,Mi), which is positive for all T < T c . 


Combining (16) and (18), we finally get 


A*(T c ,Mi) < ~(T C - T^iTuO), 


(19) 


valid for all T\ < T c . 

The next step is straightforward. We let T\ -+ T c — t so that M\ —> 0 and along with it 


A\T c ,Mi) 


Mi 

/ 


HdM 


iDM\ 


■5+1 


J T=T C 


D'(T c - , 


( 20 ) 


while 


C*rT rn [ C(T,0) , D" vl-a' 

S*(Ti,0)= / —-— dT^ - — (Tc-TxY (21) 

r c 

where D, D and D" are positive constants; here, use has been made of power laws (12.7.1, 
3, and 5b). Substituting (20) and (21) into (19), we get 

(Tc - Ti )2-“'-/*(5+i) >DlT c /D". (22) 

Again, since ( T c — Ti) can be made as small as we like, (22) will not hold if a' + + 1) < 2. 

The Griffiths inequality (8) is thus established. It will be noted that unlike the Rushbrooke 
inequality, which related critical exponents pertaining only to T < T c , the present inequal¬ 
ity relates two such exponents, a' and /), with one, namely S, that pertains to the critical 
isotherm (T = T c ). 

While inequalities (1) and (8) are thermodynamically exact, Griffiths has derived several 
others that require certain plausible assumptions on the system in question. We quote two 
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of them here, without proof: 


/>,B(S-l) (23) 

y > (2 — a)(S — l)/(<5 + 1). (24) 

For a complete list of such inequalities, see Griffiths (1972), p. 102, where references to 
original papers are also given. 

Before proceeding further, the reader may like to verify that the experimental data on 
critical exponents, as given earlier in Table 12.1, do indeed conform to the inequalities 
proved or quoted in this section. It is important in this connection to note that the mean 
field exponents (a — a' — 0, f = 1/2, y = y’ = 1, and S = 3) satisfy all these relations as 
equalities. 


12.9 Landau's phenomenological theory 

As early as 1937 Landau attempted a unified description of all second-order phase transi¬ 
tions — second-order in the sense that the second derivatives of the free energy, namely 
the specific heat and the magnetic susceptibility (or isothermal compressibility, in the 
case of fluids), show a divergence while the first derivatives, namely the entropy and 
the magnetization (or density, in the case of fluids), are continuous at the critical point. 
He emphasized the importance of an order parameter mo (which would be zero on the 
high-temperature side of the transition and nonzero on the low-temperature side) and 
suggested that the basic features of the critical behavior of a given system may be deter¬ 
mined by expanding its free energy in powers of mo (for we know that, in the close vicinity 
of the critical point, mo <k( 1). He also argued that in the absence of the ordering field (h = 0) 
the up-down symmetry of the system would require that the proposed expansion contain 
only even powers of mo. Thus, the zero-field free energy fo(— Ao/NkT ) of the system may 
be written as 


i h(t, mo) = q(t) + r(t)ml + s{t)ml + • 


t= ,\t\ « 11 j 


at the same time, the coefficients q(t), r(t), s(t)... may be written as 


q(t) = J2 c lkt k , r(t)=^r k t k , s(t) = ■ 

k> 0 k> 0 k> 0 


( 1 ) 


( 2 ) 


The equilibrium value of the order parameter is then determined by minimizing fo 
with respect to mo; retaining terms only up to the order displayed in (1), which for 
thermodynamic stability requires that s(t) > 0, we obtain 


r(t)mo +2s(t)niQ = 0. 


(3) 
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The equilibrium value of mo is thus either 0 or ±^/[-r(t)/2s(t)\. The first solution is of 
lesser interest, though this is the only one we will have for t > 0; it is the other solutions that 
lead to the possibility of spontaneous magnetization in the system. To obtain physically 
sensible results, see equations (9) through (11), we musthave in equation (2): ro = 0, n > 0, 
and sq > 0, with the result 


\m 0 \^l(ri/2s 0 )\t\\ 1/2 (t< 0), 


(4) 


giving p = 1/2. 

The asymptotic expression for the free energy, namely 

i/o(t,m 0 )^qo + ritml+s 0 m% (r 1 ,s 0 >0), (5) 

is plotted in Figure 12.12. We see that, for t > 0, there is only one minimum, which is located 
at mo — 0; for t = 0, the minimum is rather flat. For t <0, on the other hand, we have two 
minima, located at mo = ±m s , as given by expression (4), with a maximum at mo = 0. Now, 
since fo has to be convex in mo, so that the susceptibility of the system be nonnegative, 
see equation (12.8.7b), we must replace the nonconvex portion of the curve, which lies 
between the points mo = —m s and mo — +m s , by a straight line (along which the suscep¬ 
tibility would be infinite). This replacement is reminiscent of the Maxwell construction 
employed in Sections 12.1 and 12.2. 

We now subject the system to an ordering field h, assumed positive. If the field is weak, 
the only change in the expression for the free energy would be the addition of a term -hm. 
Disregarding the appearance of any higher powers of (hm) as well as any modifications of 


<A 0 ( f ’ m o> 



FIGURE 12.12 The free energy of the Landau theory, shown as a function of mo, for three different values 

of t. The dashed curve depicts spontaneous magnetization m s (f), while the horizontal line for t < 0 provides the 
Maxwell construction. 
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the coefficients already present, we may now write 

m ) = —hm + q(t) + r(t)m 2 + s(t)m 4 . ( 6 ) 

The equilibrium value of m is now given by 15 

—h + 2r(t)m +4s{t)m 3 = 0. (7) 

The low-field susceptibility of the system, in units of Np 2 /kT, thus turns out to be 


x = 


3h\ 


-l 


1 


( 8 ) 


dm) t 2r(t) + \2s(t)m 2 ' 

valid in the limit h -» 0. Now, for t > 0, m -> 0 and we get 

x»l/2nr (f>0), (9) 

giving y = f. On the other hand, for t < 0, m-+ s/[{r\/2so)\t\], see (4); we then get 

x«l/4n|f| (r<0), ( 10 ) 

giving y’ — 1. Finally, if we set t = 0 in (7), we obtain the following relation between h 
and m: 


h rb 45q m 3 


(h -»■ 0), 


( 11 ) 


giving 8 = 3. 

We shall now look at the specific heats C;, and C m . If t > 0, then h -+ 0 implies m -»• 0, 
so in this limit there is no difference between Ch and C m . Equation (1) then gives, in units 
of Nk, 

1 3 2 V f o\ \ 

c h = C m = -l -^ L ) =-(2q 2 +6q 3 t+---) (t > 0). (12) 

V / m—^0 

For t < 0, we have 


Cm — ~ 


[(2*72 + 6 f 731 +■■•) + (2r 2 + • ■ ■ )m 2 -j 


= - [ 2 q 2 + { 6^3 - (rir 2 /s 0 )}t + ...] (t < 0 ). 
Next, using equation (12.8.2) along with (4) and (10), we have 


Ch — C m — 


3 h 
dm 


3 m\ 
dt )h 


' (f<0). 

25q 


(13) 


(14) 


15 It may be mentioned here that the passage from equation (1) to (6) is equivalent to effecting a Legendre transfor¬ 
mation from the Helmholtz free energyA to the Gibbs free energy G (=A-HM), and equation (7) is analogous to the 
relation (M/3M)r = H. 
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It follows that, while C m possesses a cusp-like singularity at t = 0, C;, undergoes a jump 
discontinuity of magnitude 

(C fc ) t -o- - (Cfc)^.o+ « rf/2s 0 . (15) 

It follows that a = a' = 0. 

The most striking feature of the Landau theory is that it gives exactly the same critical 
exponents as the mean field theory of Sections 12.5 and 12.6 (or the van der Waals theory 
of Section 12.2). Actually it goes much further, for it starts with an expression for the free 
energy of the system containing parameters q^, r^.s ^,..., which represent the structure of 
the given system and the interactions operating in it, and goes on to show that, while the 
amplitudes of the various physical quantities near the critical point do depend on these 
parameters, the critical exponents do not! This universality (of critical exponents) suggests 
that we are dealing here with a class of systems which, despite their structural differences, 
display a critical behavior that is qualitatively the same for all members of the class. This 
leads to the concept of a universality class which, if Landau were right, would be a rather 
large one. The fact of the matter is that the concept of universality is very much overstated 
in Landau’s theory; in reality, there are many different universality classes — each defined 
by the parameters d and n of Section 12.7 and by the range of the microscopic interac¬ 
tions — such that the critical exponents within a class are the same while they vary from 
one class to another. The way Landau’s theory is set up, the parameter n is essentially equal 
to 1 (because the order parameter m 0 is treated as a scalar), the parameter d plays no role 
at all (though later on we shall see that the mean field exponents are, in fact, valid for all n 
if d > 4), while the microscopic interactions are implicitly long-ranged. 16 

An objection commonly raised against the Landau theory is that, knowing fully well 
that the thermodynamic functions of the given system are going to be singular at t = 0, a 
Taylor-type expansion of the free energy around m = 0 is patently a wrong start. While the 
objection is valid, it is worth noting how a regular function, (1) or (6), leads to an equation 
of state, (3) or (7), which yields different results for t -> 0— from the ones for t —> 0+, the 
same being true of whether h -> 0+ or 0—. The trick lies in the fact that we are not using 
equation (1) or (6) as such for all f, for t < 0, we use instead a modified form, as “corrected” 
by the Maxwell construction (see Figure 12.12). The spirit of the singularity is thereby cap¬ 
tured, though the nature of the singularity, being closely tied with the nature of the original 
expansion, could not be any different from the mean-field type. The question now arises: 
how can the Landau theory be improved so that it may provide a more satisfactory picture 
of the critical phenomena? Pending exact analyses, one wonders if some generalization of 
the Landau approach, admitting more than one universality class, would provide a bet¬ 
ter picture than the one presented so far. It turns out that the scaling approach, initiated 
by Widom (1965), by Domb and Hunter (1965), and by Patashinskii and Pokrovskii (1966), 
provided the next step in the right direction. 

16 In certain systems such as superconductors, the effective interactions (which, for instance, lead to the formation of 
Cooper pairs of electrons) are, in fact, long-ranged. The critical exponents pertaining to such systems turn out to be the 
same as one gets from the mean field theory. For details, see Tilley and Tilley (1990). 
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12.10 Scaling hypothesis for thermodynamic 
functions 

The scaling approach, which took the subject of phase transitions far beyond the mean 
field theory, emerged independently from three different sources — from Widom (1965), 
who searched for a generalization of the van der Waals equation of state that could 
accommodate nonclassical exponents; from Domb and Hunter (1965), who analyzed the 
behavior of the series expansions of higher derivatives of the free energy with respect to 
the magnetic field at the critical point of a magnetic system; and from Patashinskii and 
Pokrovskii (1966), who studied the behavior of multipoint correlation functions for the 
spins constituting a system. All three were led to the same form of a thermodynamic equa¬ 
tion of state. Subsequently, Kadanoff (1966a) suggested a scaling hypothesis from which 
not only could this equation of state be derived but one could also obtain a number of rela¬ 
tions among the critical exponents, which turned out to be equalities consistent with the 
findings of Section 12.8. This approach also made it clear why one needed only two inde¬ 
pendent numbers to describe the nature of the singularity in question; all other relevant 
numbers followed as consequences. 

To set the stage for this development, we go back to the equation of state following from 
the Landau theory, namely (12.9.7), and write it in the asymptotic form 

h(t, m) 2rifm + 4som 3 . (1) 


In view of the relationship (12.9.4), we rewrite (1) in the form 


h(t, m ) 


3/2 

—— I fl 3 / 2 
„l/2 
*0 


2 sgn(t) 


d/2 


r\' 2 m 1/2 


+ 4 


d/2 


r \' 2 m 1/2 


( 2 ) 


It follows that 


m(t, h) 


T 

d— 1 1\ b 2 x a function of 

C 

*0 


to 2 h \ 

r 3 /2 \t\ 3 i 2 J 


(3) 


and, within the context of the Landau theory, the function appearing here is universal for 
all systems conforming to this theory. In the same spirit, the relevant part of the free energy 
i //hit, m) — the part that determines the nature of the singularity — may be written in the 
form 


fh’(t, m) ■ 


■ —hm + r\ tm 2 + SQtn 4 


= M 2 

So 


d/ 2 


r 3/2 |f|3/2 


d/ 2 


r\' 2 \t \ 1/2 


d/2 


m 


r\' 2 Id 1/2 


+ sgn(t) 


d/ 2 


d/ 2 


m \ 

W 2 ) 


(4) 


+ 


4" 


(5) 
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Substituting (3) into (5), one gets 



( 6 ) 


where, again, the function appearing here is universal. As a check, we see that differentiat¬ 
ing (6) with respect to h we readily obtain (3). 

The most notable feature of the equation of state, as expressed in (3), is that, instead 
of being the usual relationship among three variables m, h, and t, it is now a relationship 
among only two variables, namely m/\t I 1 / 2 and h/\L \ 3/2 . Thus, by scaling m with hi 1 / 2 and 
h with t | 3/2 , we have effectively reduced the total number of variables by one. Similarly, 
we have replaced equation (4) by (6), which expresses the singular part of the free energy 
\[r scaled with t 2 as a function of the single variable h scaled with |t| 3 / 2 . This reduction in 
the total number of effective variables may be regarded as the first important achievement 
of the scaling approach. 

The next step consists of generalizing (6), to write 


f is \t,h) S3 F\t\ 2 ~ a f(Gh/\t\ A ) 


(7) 


where a and A are universal numbers common to all systems in the given universality 
class, f(x) is a universal function which is expected to have two different branches, f+ for 
t > 0 and/_ for t < 0, while F and G (like r\ and so) are nonuniversal parameters char¬ 
acteristic of the particular system under consideration. We expect a and A to determine 
all the critical exponents of the problem, while the amplitudes appearing in the various 
power laws will be determined by F, G, and the limiting values of the function fix) and its 
derivatives (as x tends to zero). Equation (7) constitutes the so-called scaling hypothesis, 
whose status will become much more respectable when it acquires legitimacy from the 
renormalization group theory; see Sections 14.1 and 14.3. 

First of all it should be noted that the exponent of |t|, outside the function fix) in 
equation (7), has been chosen to be (2 —a), rather than 2 of the corresponding mean 
field expression (6), so as to ensure that the specific heat singularity is correctly repro¬ 
duced. Secondly, the fact that one must not encounter any singularities as one crosses the 
critical isotherm it — 0) at nonzero values of h or m requires that the exponents on the 
high-temperature side of the critical point be the same as on the low-temperature side, 
that is, 


a = a and y' = y. 


( 8 ) 


From equation (7) it readily follows that 




0) 




448 Chapter 12 • Phase Transitions: Criticality, Universality, and Scaling 


and 


x(f,h) = ~ ( ] » —FG 2 \t\ 2 ~ a ~ 2A f'(Gh/\t\ A ). 


Letting h ->• 0, we obtain for the spontaneous magnetization 

m(l,0) kb B\t\P (t< 0), 

where 

B=-FGf!_( 0), ft = 2 — a — A, 

and for the low-field susceptibility 


x(t.o)*ifr 


C+ Ct> 0) 
C_ (1<0), 


where 

C± = -FG 2 fl (0), y = a + 2A - 2. 

Combining (12b) and (14b), we get 

A = ft + y = 2 — a — ft, 

so that 

a + 2 ft + y = 2. 

To recover 8, we write the function/'(x) of equation (9) as x^/ A g(x), so that 

m(t,h ) ks — FG (1+ ^ A) fi^ A g(G/i/|f| A ). 


(10) 


( 11 ) 


(12a, b) 


(13a) 

(13b) 


(14a, b) 


(15) 


(16) 


(17) 


Inverting (17), we can write 


|f| ks G x/A h l/A x a function of (FG (1+ ^ A> h^ A /m). (18) 

It follows that, along the critical isotherm (1 = 0), the argument of the function appearing 
in (18) would have a universal value (which makes the function vanish), with the result 
that 


m~FG a+p/A) h p/A (1 = 0). (19) 

Comparing (19) with (12.7.3), we infer that 


8 = A/fi. 


(20) 
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Combining (20) with the previous relations, namely (12b) and (15), we get 

a+ 13(8 + 1) = 2 (21) 


and 


y = 0(8-1). 


( 22 ) 


Finally, combining (21) and (22), we have 


y = (2-«)(«-!)/(« + !). 


For completeness, we write down for the specific heat C;, at h = 0 


Cf(t, 0) = 


g2^(s) 


h— >0 


— (2 — a)(l — a)F\t\~ a 


/+( 0 ) 
/-( 0 ) 


h>«) 

(f <0). 


(23) 


(24a) 

(24b) 


We thus see that the scaling hypothesis (7) leads to a number of relations among 
the critical exponents of the system, emphasizing the fact that only two of them are 
truly independent. Comparing these relations with the corresponding ones appearing 
in Section 12.8 — namely, (16), (21), (22), and (23) with (12.8.1), (12.8.8), (12.8.23), and 
(12.8.24) — we feel satisfied that they are mutually consistent, though the present ones 
are far more restrictive than the ones there. Besides exponent relations, we also obtain 
here relations among the various amplitudes of the problem; though individually these 
amplitudes are nonuniversal, certain combinations thereof turn out to be universal. For 
instance, the combination ( FC±/B 2 ), which consists of coefficients appearing in equa¬ 
tions (7), (11), and (13), is universal; see equations (12a) and (14a). The same is true of the 
ratio C+/C-. For further information on this question, see the original papers by Watson 
(1969) and a review by Privman, Hohenberg, and Aharony (1991). 

We now pose the question: why do “universality classes” exist in the first place? In 
other words, what is the reason that a large variety of systems differing widely in their 
structures should belong to a single universality class and hence have common critical 
exponents and common scaling functions? The answer lies in the role played by the corre¬ 
lations among the microscopic constituents of the system which, as T T c , become large 
enough to prevail over macroscopic distances in the system and in turn make structural 
details at the local level irrelevant. We now turn our attention to this important aspect of 
the problem. 


12.11 The role of correlations and fluctuations 

Much can be learned about criticality by scattering radiation — light, x-rays, neutrons, and 
so on — off the system of interest; see Section 10.7.A. In a standard scattering experiment, 
a well-collimated beam of light, or other radiation, with known wavelength X is directed at 
the sample and one measures the intensity, 1(0), of the light scattered at an angle 0 from 




450 Chapter 12 • Phase Transitions: Criticality, Universality, and Scaling 


the “forward” direction of the beam. The radiation undergoes a shift in the wavevector, k, 
which is related to the parameters 0 and X by 



( 1 ) 


Now, the scattered intensity 1(6) is determined by the fluctuations in the medium. If the 
medium were perfectly uniform (i.e., spatially homogenous), there would be no scattering 
at all! If one has in mind light scattering from a fluid, then the relevant fluctuations corre¬ 
spond to regions of different refractive index and, hence, of different particle density n(r). 
For neutron scattering from a magnet, fluctuations in the spin or magnetization density 
are the relevant quantities, and so on. We need to study here the normalized scattering 
intensity 1(9; T,H)/I ldeal (6), where I(6;T,H) is the actual scattering intensity observed 
at angle 6, which will normally depend on such factors as temperature, magnetic held, 
and so on, while / ldeal (0) is the scattering that would take place if the individual parti¬ 
cles (or spins) that cause the scattering could somehow be taken far apart so that they no 
longer interact and hence are quite uncorrelated with one another. Now, this normalized 
scattering intensity turns out to be essentially proportional to the quantity 



( 2 ) 


which represents the Fourier transform of the appropriate real-space correlation function 
g(r), which will be defined shortly. 

As the critical point of the system (say, a fluid) is approached, one observes an enor¬ 
mously enhanced level of scattering, especially at low angles which corresponds, via 
equations (1) and (2), to long wavelength density fluctuations in the fluid. In the criti¬ 
cal region, the scattering is so large that it can be visible to the unaided eye, particularly 
through the phenomenon of critical opalescence. This behavior is, by no means, limited 
to fluids. Thus if, for example, one scatters neutrons from iron in the vicinity of the Curie 
point, one likewise sees a dramatic growth in the low-angle neutron scattering intensity, 
as sketched in Figure 12.13. One sees that for small-angle scattering there is a pronounced 
peak in 1(9; T) as a function of temperature, and this peak approaches closer and closer to 
T c as the angle is decreased. Of course, one could never actually observe zero-angle scat¬ 
tering directly, since that would mean picking up the oncoming beam itself, but one can 
extrapolate to zero angle. When this is done, one finds that the zero-angle scattering 1(0; T) 
actually diverges at T c . This is the most dramatic manifestation of the phenomenon of criti¬ 
cal opalescence and is quite general, in that it is observed whenever appropriate scattering 
experiments can be performed. Empirically, one may write for small-angle scattering 


fmax($) h A| , { T max (9) — T c } ~ k /2 


■max' 


(3) 


so that 


Imax(6){ Tmax(9) ~ Tc}^ 1 ^ 2 — const. 


■max' 


(4) 
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FIGURE 12.13 Schematic plot of the elastic scattering intensity of neutrons scattered at an angle 8 from a magnetic 
system, such as iron, in the vicinity of the critical point T c . The small arrows mark the smoothly rounded maxima (at 
fixed 6) which occur at a temperature T max (8) that approaches T c as 8 ->■ 0. 


Here, W and are positive exponents (which, as will be seen later, are determined by the 
universality class to which the system belongs), while k, for a given 9, is determined by 
equation (1); note that, for small 6, k is essentially proportional to 9. 

The first real insight into the problem of critical scattering in fluids was provided by 
Ornstein and Zernike (1914) and Zernike (1916) who emphasized the difference between 
the direct influence of the microscopic interactions among the atoms of the fluid, which 
are necessarily short-ranged, and the indirect (but more crucial) influence of the density- 
density correlations that become long-ranged as the critical temperature is approached; it 
is the latter that are truly responsible for the propagation of long-range order in the sys¬ 
tem and for practically everything else that goes with it. Unfortunately, the original work of 
Ornstein and Zernike makes difficult reading; moreover, it is based on the classical theory 
of van der Waals. Nevertheless, the subject has been neatly clarified in the review arti¬ 
cles by Fisher (1964, 1983) and Domb (1985), to which the reader may turn for further 
details. Here, we shall stick to the language of the magnetic systems and work out the most 
essential parts of the theory in somewhat general terms. 

We define the spin-spin correlation function g(i,j), for the pair of spins at sites i and j, 
by the standard definition 


g(i,j) = OiOj - a iff j. (5) 

For i — j, expression (5) denotes the “mean-square fluctuation in the value of the variable 
ct at site i”\ on the other hand, as the separation between the sites i and j increases indefi¬ 
nitely, the spins n, and oj get uncorrelated, so that oyf n/oy and the function g(i,j) -> 0. 
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In view of the fact that expression (5) can also be written as 

g(i,j) = (<7j - Oi)((Tj - of), (6) 

the function g(i,j) may also be looked upon as a measure of the “correlation among 
the fluctuations in the order parameter of the system at sites i and j.” This makes sense 
because 07 may, quite appropriately, be regarded as the locally fluctuating order parameter 
linked to site i, just as a is the order parameter for the whole system. We shall now establish 
connections between the function g(i,j) and some important thermodynamic properties 
of the system. 

We start with the partition function of the system, see equation (12.3.9), 


Qn(H,T) = exp 
Wi) 


pj a i a i+Pf- H Y1 a ’ 

n.n. i 


where the various symbols have their usual meanings. It follows that 


— (\nQ N ) = = PM, 

where M(= p denotes the net magnetization of the system. Next, since 

9 2 , n 9/ 1 9 Qn \ 1 9 2 Qn _1 / 9Qv \ 2 

dH 2 ^ N > ~ d H\Q N dH )~ Q n dH 2 Q 2 N \dH ) 

= ^ 2 (M 2 -M 2 ), 


we obtain for the magnetic susceptibility of the system 


x 


9M 

a ~h 


= P(M 2 -M 2 ) 


(7) 


( 8 ) 


(9) 


( 10 a) 


“(z! 0 *') J = Pi i2 J2J2s (i ’jp ( iob ) 

Equation (10a) is generally referred to as the fluctuation-susceptibility relation; it may 
be compared with the corresponding relation for fluids, namely (4.5.7), which connects 
isothermal compressibility kt with the density fluctuations in the system. Equations (10) 
and (4.5.7) represent the equilibrium limit of the fluctuation-dissipation theorem dis¬ 
cussed later in Section 15.6. Equation (10b), on the other hand, relates x to a summation 
of the correlation function g(i,j) over all i and j; assuming homogeneity, this may be 
written as 


X=NPu 2 J2g( r) ( r — r j — rp. 


(11) 
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Treating r as a continuous variable, equation (11) may be written as 

X = J S(r) dr ’ (12) 

where a is a microscopic length, such as the lattice constant, so defined that Na d = V, the 
volume of the system; for a similar result appropriate to fluids, see equation (10.7.14). 
Finally, introducing the Fourier transform of the function g(r), through equation (2), we 
observe that 


x = 



1(0); 


(13) 


compare this result to equation (10.7.21) for fluids. 

Our next task consists in determining the mathematical form of the functions g(r ) and 
g(Jfc). Pending exact calculations, let us see what the mean field theory has to offer in this 
regard. Following Kadanoff (1976b), we consider a magnetic system subject to an exter¬ 
nal field H which is nonuniform, that is, H = {Hj}, where Hi denotes the field at site i. 
Using mean field approximation, the thermal average of the variable er* is given by, see 
equation (12.5.10), 


o'; = tanh(^/xJ/ eff ), 


(14) 


where 


H dt = H i + U/ri'£°fi (15) 

n.n. 

note that, in view of the nonuniformity of H, the product (cfo) of equation (12.5.10) has 
been replaced by a sum of Tfj over all the nearest neighbors of spin i. If a varies slowly 
in space, which means that the applied field is not too nonuniform, then (15) may be 
approximated as 


H eS ~ Hi + (qj/ u)oi + (c/a 2 //r)V 2 cr ! -, (16) 

where c is a number of order unity whose actual value depends on the structure of the 
lattice, while a is an effective lattice constant; note that the term involving Vo 7 ,- cancels 
on summation over the q nearest neighbors that are supposed to be positioned in some 
symmetrical fashion around the site i. At the same time, the function tanhx, for small x, 
maybe approximated byx — x 3 /3. Retaining only essential terms, we get from (14) and (16) 

PuHi = (1 - qfiJWi + i {qpjfa] - cpja 2 V 2 Oi. 


(17) 
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Now, the conditions for criticality are {Hj} = 0 and qfij = 1; see equation (12.5.13). So, near 
criticality, we may introduce our familiar variables 

hi = PnH t , t = (T — T c )/T c ~ (p c - pypd (18) 


equation (17) then reduces to 

(t + - c'a 2 V 2 ^j at = hi, 


(19) 


where d is another number of order unity. Equation (19) generalizes equation (12.10.1) of 
Landau’s theory by taking into account the nonuniformity of a. 

Differentiating (19) with respect to hj, we get 

(t + - da 2 V 2 ) ^ = Sij. (20) 

The “response function” dWj/dhj is identical with the correlation function g(i,j); 17 equa¬ 
tion (20) may, therefore, be written as 

(t + o’? - c'« 2 V 2 ) g(i,j) = Sij. (21) 

For t > 0 and {h,} -»• 0, ?f ; - 0; equation (21) then becomes 

(t-c'a 2 V 2 )g(i,j) = S i j. (22) 

Assuming homogeneity, so that g(i,j) = g(r) where r — rj - r lt and introducing Fourier 
transforms, equation (22) gives the form 

(t + c'cdk 2 ^ g(k ) = const. (23) 

It follows that g(k) is a function of the magnitude k only (which is not surprising in view of 
the assumed symmetry of the lattice). Thus 


17 Remembering that M = /iSicr;, we change the field jff,) to {Hi + SHp, with the result that 


8M = hJ2 


^C(9(T i/dHj)8Hj 


Now, for simplicity, we let all 8Hj be the same; then 

(SM/8H) = 

* J 


Comparing this with (10b), we infer that ( dcri/dHj ) = P/ig(i,j ) and hence ( dcri/dhj ) = g(i,j). 
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which is the famous Ornstein-Zernike result derived originally for fluids. 

Now, taking the inverse Fourier transform of g(k), we obtain (disregarding numerical 
factors that are not so essential for the present argument] 

g(r)~ f 6 d d (ka) (25a) 

J t+c'a z k z 

7 a d / l \ < rf - 2) /2 

(25b) 

0 

see equations (8) and (11) of Appendix C. The integral in (25b) is tabulated; see Gradshteyn 
and Ryzhik (1965, p. 686). We get 

/ 2 \ 2)/2 . . 

g(r) ~ (^ J K(d- 2)/2 (J) 1? = a(c'/t) 1/2 }, (26) 

(x) being a modified Bessel function. For x» 1, K /t (x) ~x _1 / 2 e _x ; equation (26) then 
gives 


g ( '')^ g( rf -3V2^-l)/2 e ~ r/l (r>>?) - (2?) 

On the other hand, for x < 5 C 1, (x), for fi > 0, ~ x _/i ; equation (26) then gives 

fid—2 

g(r)~ 7 d =2 (»■«$;«* >2). (28) 

In the special case d = 2, we obtain instead 

g(r)~ln($/r) (r«f;d = 2). (29) 

It is worth noting that equation (26) simplifies considerably when d = 3. Since K\ / 2 (x) is 
exactly equal to (n/2x) l ! 2 e~ x for a// x, 

g(r)ld= 3 ~ (30) 

for all r. Equation (30) is another important result of Ornstein and Zernike. 

Clearly, the quantity £ appearing here is a measure of the “distances over which the 
spin-spin (or density-density) correlations in the system extend” — hence the name 
correlation length. So long as T is significantly above T c , f = 0(a); see (26). However, as 
T approaches T c , § increases indefinitely — ultimately diverging at T — T c . The resulting 
singularity is also of the power-law type: 

H-ar 1 ! 2 (f> 0). 


(31) 
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The divergence of £ at T — T c is perhaps the most important clue we have for our 
general understanding of the critical phenomena. As £ -»• oo, correlations extend over the 
entire system, paving the way for the propagation of long-range order (even though the 
microscopic interactions, which are at the root of the phenomenon, are themselves short- 
ranged) . Moreover, since correlations extend over macroscopic distances in the system, any 
structural details that differentiate one system from another at the microscopic level lose 
significance, leading thereby to universal behavior! 

Going back to equations (13) and (24), we see that the singularity in / is indeed of the 
type expected in a mean field theory, namely 

X ~g(0) ~ r 1 (f > 0). (32) 

In view of the foregoing results, one may write 

_L~I(l + f 2 fc 2 ) (33) 

g(k) X 

In a so-called Ornstein-Zernike analysis, one plots 1 /g(k) (or 1/I(k), where I(k) is the 
intensity of the light scattered at angle 6, see equation (1)) in the critical region versus k 2 . 
The data for small k (ka < 0.1), for which the above treatment holds, fall close to a straight 
line whose intercept with the vertical axis determines x (f) • As t -*■ 0, this intercept tends to 
zero but the successive isotherms remain more or less parallel to one another; the reduced 
slope evidently serves to determine §(f). For t ~ 0, these plots show a slight downward cur¬ 
vature, indicating departure from the A; 2 -law to one in which the power of k is somewhat 
less than 2. Finally, as regards the plot 1(6; T) of Figure 12.13 earlier in this section, the max¬ 
imum in the curve, according to equation (24), should lie at t — 0 for all 0 and the height 
of the maximum should be ~ k~ 2 (i.e., essentially ~ 8 ~ 2 ); thus, according to the mean held 
expression for g(k), the exponent >.\ in equation (3) should be 2 while a 2 should be 0. 


12.12 The critical exponents v and 1 / 

According to the mean held theory, the divergence of £ at T = T c is governed by the 
power law (12.11.31), with a critical exponent We anticipate that the experimental data 
on actual systems may not conform to this law. We, therefore, introduce a new critical 
exponent, v, such that 


(h^0,t>0). U) 

In the spirit of the scaling hypothesis, see Section 12.10, the corresponding exponent 
v' appropriate to t<0 would be the same as v. 18 Table 12.1 in Section 12.7 shows 

18 It turns out that the exponent v' is relevant only for scalar models, for which n = 1; for vector models (n > 2), f is 
infinite at all T <T C and hence v' is irrelevant. 
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experimental results for v obtained from a variety of systems; we see that the observed 
values of v, while varying very little from system to system, differ considerably from the 
mean field value. 

As regards the correlation function, the situation for t > 0 is described very well by a law 
ofthe type (12.11.27), namely 

g{r) ~ e~ r ^ (t) x some power of r (f > 0), (2) 

where f (t) is given by (1). The variation of g(r) with r in this regime is governed primarily 
by the exponential, so g(r ) falls rapidly as r exceeds § (which is typically of the order of the 
lattice constant a). As t -»• 0 and hence f oo, the behavior of g(r) would be expected to be 
like equation (12.11.28) or (12.11.29). A problem now arises: we have an exact expression 
for g(r) at T — T c for a two-dimensional Ising model (see Section 13.4), according to which 

g(r) ~ r -1 / 4 (d = 2,n = 1, t = 0), (3) 

which is quite different from the mean field expression (12.11.29). We, therefore, generalize 
our classical result to 


g{r) ~ r~ (d ~ 2+,1) (t = 0), 


(4) 


which introduces another critical exponent, Clearly, ij for the two-dimensional Ising 
model is which can even be confirmed by experiments on certain systems that are 
effectively two-dimensional. Table 12.1 shows experimental values of rj for some systems 
in three dimensions; typically, rj turns out to be a small number, which makes it rather 
difficult to measure reliably. 

We shall now derive some scaling relations involving the exponents v and rj. First of 
all we write down the correlation function g(r; t, h) and its Fourier transform g(k; t, h) in a 
scaled form. For this, we note that, while h scales with f A , the only natural variable with 
which r will scale is £; accordingly, r will scale with t~ v . We may, therefore, write 


g(r; t, h ) 


G(rt",h/t A ) 

yd, — 2+t] 


g(k ; t, h ) 


G(k/t v ,h/t A ) 
k^i 


(5a, b) 


where the functions G(x,y) and G(z,y), like the exponents A, v, and rj, are universal for a 
given universality class; in expressions (5), for simplicity, we have suppressed nonuniversal 
parameters that vary from system to system within a class. In the absence of the field (h — 
0), expressions (5) reduce to 


g(r, t, 0) 


Go(rd') 

yd —2 -\-Tj 


g(k ; t, 0) 


GpQc/n 

k 2 -') 


(6a, b) 
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where Go(x) and Go(z) are again universal. At the critical point (h = 0,t = 0), we have 
simply 

Sc(r)-^^, ( 7a - b > 

We now recall equation (12.11.12), which relates x to an integral of g(r) over dr, and 
substitute expression (6a) into it. We get, ignoring nonuniversal parameters as well as 
numerical factors, 


x 


/ 


G o (rt '">d-\ 

yd — 2+q 


dr. 


( 8 ) 


By a change of variables, this gives 


X ~ (9) 

Invoking the standard behavior of /, we obtain 


y = (2—»?)y. (10) 

Note that the same scaling relation can also be obtained by appealing to equa¬ 
tions (12.11.13) and (6b); the argument is that, in the limit k -»• 0, the function Go(z) must 
be ~ z 2 ~ 71 (so that k is eliminated), leaving behind a result identical to (9). In passing, we 
note that in the critical region 


x~$ 2 ~ n - (ID 

Relation (10) is consistent with the Fisher inequality (1969) 

y < (2 - jj)v, 


and is obviously satisfied by the mean field exponents (y = 1, v = r] = 0); it also checks 
well with the experimental data given earlier in Table 12.1. In fact, this relation provides 
a much better method of extracting the elusive exponent i], from a knowledge of y and v, 
than determining it directly from experiment. Incidentally, the presence of rj explains the 
slight downward curvature of the Ornstein-Zernike plot, 1 /g(k) versus lc 2 , as k -> 0, for the 
appropriate expression for 1 /g(k) now is 

^-~-(l +? 2 -"fc 2 -"), (12) 

g(k) x 

rather than (12.11.33). 

We shall now derive another exponent relation involving v, but first notice that all expo¬ 
nent relations derived so far have no explicit dependence on the dimensionality d of the 
system (though the actual values of the exponents do depend on d). There is, however, 
one important relationship that does involve d explicitly. For this, let us visualize what 
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happens inside the system (say, a magnetic one) as t 0 from above. At some stage the 
correlation length £ becomes significantly larger than the atomic spacings, with the result 
that magnetic domains, of alternating magnetization, begin to appear. The closer we get 
to the critical point, the larger the size of these domains; one may, in fact, say that the vol¬ 
ume of any such domain is ~ f d . Now, the singular part of the free energy density of the 
system — or, for that matter, of any one of these domains — is given by, see equa¬ 
tion (12.10.7) with h = 0, 


/ (s) (t)~f 2 -“, (13) 

which vanishes as t -*■ 0. At the same time, the domain volume C diverges. It seems natural 
to expect that/ (s) , being a density, would vanish as 1/Q, that is, 

/ (s) (t)~fi- 1 ~t- rf ~f dy . (14) 

Comparing (13) and (14), we obtain the desired relationship 

dv = 2 — a, (15) 

which is generally referred to as a hyperscaling relation — to emphasize the fact that it goes 
beyond, and cannot be derived from, the ordinary scaling formulation of Section 12.10 
without invoking something else, such as the domain volume £2. 

The relation in equation (15) is consistent with the Josephson inequalities (1967): 

dv>2 — a, dv'>2 — a', (16a, b) 

proved rigorously by Sokal (1981); of course, the scaling theory does not distinguish 
between exponents pertaining to t > 0 and their counterparts pertaining to t < 0. It is 
important to note that the classical exponents (v = \,ol = 0) satisfy (15) only for d = 4, 
which shows that the hyperscaling relations, (15) and any others that follow from it, have a 
rather different status than the other scaling relations (that do not involve d explicitly). The 
renormalization group theory, to be discussed in Chapter 14, shows why the hyperscaling 
relations are to be expected fairly generally, why typically they hold for d < 4 but break 
down for d > 4; see also Section 12.13. The reader may check that relation (15) is satisfied 
reasonably well by the experimental data of Table 12.1, with d = 3; it is also satisfied by the 
exponents derived theoretically by solving different models exactly, or almost exactly, as in 
Chapter 13. 

Combining (15) with other scaling relations, see Section 12.10, we may write 

dv = 2 - a = 2/3 + y = fi(8 + 1) = y (8 + 1)/(S - 1). (17) 


It follows that 


2 -n = y/v = d(8- l)/(8 + 1) 


(18) 
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which is consistent with the Buckingham-Gunton inequality (1969) 

2 — ri< d{8 —1)/(<5 +1). (19) 

Notice that the experimental observation that, for magnetic systems in three dimensions, 
S < 4.8, implies that rj > 0.034. 

12.13 A final look at the mean field theory 

We now return to the question: why does the mean field theory fail to represent the true 
nature of the critical phenomena in real systems? The short answer is — because it neglects 
fluctuations! As emphasized in Section 12.11, correlations among the microscopic con¬ 
stituents of a given system are at the very heart of the phenomenon of phase transitions, 
for it is through them that the system acquires a long-range order (even when the micro¬ 
scopic interactions are themselves short-ranged). At the same time, there is so direct a 
relationship between correlations and fluctuations, see equation ( 12 . 11 . 6 ), that they grow 
together and, as the critical point is approached, become a dominant feature of the system. 
Neglecting fluctuations is, therefore, a serious drawback of the mean field theory. 

The question now arises: is mean field theory ever valid? In other words, can fluctua¬ 
tions ever be neglected? To answer this question, we recall the fluctuation-susceptibility 
relation ( 12 . 11 . 10 a), namely 

X = (AG-M 2 )/kT= (AM) 2 /kT (1) 


and write it in the form 


(A M) 2 /M 2 = kT X /M 2 . (2) 

Now, in order that the neglect of fluctuations be justified, we must have: 

kTx « M 2 . (3) 

Requirement (3) is generally referred to as the Ginzburg criterion (1960); for a more 
detailed discussion of this criterion, along with physical illustrations, see Als-Nielsen and 
Birgeneau (1977). 

We apply condition (3) to a domain, of volume ~ £ d , close to but below the critical 
point; we are assuming here a system of the Ising type (n = 1), so that £ is finite for t < 0. 
Invoking the power-law behavior of / and M, we have 

kT c (A^ d \t\~ Y ) « (B% d \t\P) 2 (t< 0), (4) 

where A and B are positive constants. Since £ ~ a\t\~ v , we get 

\ t \dv-2p- y ^B 2 a d /AkT c . 


(5) 
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In view of the scaling relation a + 2/8 + y — 2, we may as well write 

w dv-(2-a) <D ( 6 ) 

where D is a positive number of order unity. 19 For the mean field theory (with v — \,a = 0) 
to be valid, condition (6) assumes the form 

|t| (d —4)/2<D. (7) 

Now, since \t\ can be made as small as we like, condition (7) will be violated unless d> 4. 
We, therefore, conclude that the mean field theory is valid for d > 4; by implication, it is 
inadequate for d < 4. 

The preceding result has been established for scalar models (n = 1) only, fn 
Section 13.5, we shall see that in the case of the spherical model, which pertains to the limit 
n -*■ oo, the mean field results do apply when d > 4. This means that, once again, fluctua¬ 
tions can be neglected if d > 4. Now fluctuations are supposed to decrease with decreasing 
n; the validity of the mean field theory for d > 4 should, therefore, hold for all n. 

Ordinarily, when a system is undergoing a phase transition, expression (2), which is 
a measure of the relative fluctuations in the system, is expected to be of order unity. 
Condition (6) then suggests that the exponents v and a obey the hyperscaling relation 

dv = 2 — a. (8) 

Experience shows that this relation is indeed obeyed when d < 4. At d — 4, the mean field 
theory begins to take over and thereafter, for all d > 4, the critical exponents are stuck at 
the mean field values (which are independent of both d and n). The dimensionality d = 4 
is often referred to as the upper critical dimension for the kind of systems under study. 

An alternative way of looking into the question posed at the beginning of this section 
is to examine the specific heat of the system which, according to the mean field theory, 
undergoes a jump discontinuity at the critical point whereas in real systems it shows a 
weak divergence. The question now arises: what is the source of this divergence that is 
missed by the mean field theory? The answer again lies in the “neglect of fluctuations." To 
see it more explicitly, we look at the internal energy of the system which, in the absence of 
the field, is given by 



(9) 


In the mean field theory, one replaces may by a ; ay(= er 2 ), see equation (12.5.5), which leads 
to the jump discontinuity in the specific heat of magnitude j/Nk] see equation (12.5.18). 

19 To see this, we note that A ~ N/i 2 / £2kT c while B~ N^/Q, with the result that D~ Na d /Cl = 0(1). 
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The fluctuational part of U, which is neglected in the mean field theory, may be written as 

Uf = - at*]) = ( 10 ) 

n.n. n.n. 


where g(r) is the spin-spin correlation function, for which we may use the mean field 
expression (12.11.26); thus, we will be using the mean field theory itself to predict its own 
shortcomings! Since the nearest-neighbor distances r L j in (10) are all much smaller than 
£, one may be tempted to use for g(r L j) the zeroth-order approximation (12.11.28). This, 
however, produces a temperature-independent term, which does not contribute to the 
specific heat of the system. We must, therefore, go to the next approximation, which can 
be obtained by using the asymptotic formulae 


W\x«l 


i (i r" i fi Y 

gYY + 2 r( Y2 x ) 


2 —M 


for 

0 < /x < 1 

(11a) 

for 

/x = 1 

(lib) 

for 

/X > 1, 

(11c) 


with /x = (d — 2)/2 and x — ry /£. The temperature-dependent part of Uf comes from the 
second term(s) in (11); remembering that £ here is ~ at -1 / 2 , we get 20 


(Uf / NJ ) thermal 


t (d ~ 2)/2 for 2 < d < 4 

fln(l/t) for d = 4 
t for d > 4. 


The fluctuational part of the specific heat then turns out to be 


(12a) 

(12b) 

(12c) 


Cf/Nk 


f (d-4)/2 f or 2 < d < 4 
ln(l/t) for d = 4 

const. for d > 4. 


(13a) 

(13b) 

(13c) 


It follows that the specific-heat singularity for d > 4 is indeed a “jump discontinuity,” 
and hence the mean field theory remains applicable in this case. For d = 4, Cf shows a 
logarithmic divergence and for d < 4 a power-law divergence, making mean field theory 
invalid for d < 4. 

It is rather instructive to see what part of the fluctuation-correlation spectrum, g(k), 
contributes significantly to the divergence of the specific heat at t = 0. For this, we examine 

20 Note that the negative sign in (10) cancels the implicit negative sign of F(-/O in (11a), that of ln(jX) in (lib), and 
the explicit negative sign in (11c). 
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the quantity -dg(rij)/dt, which essentially determines the behavior of Cf near the critical 
point; see equation (10). Using equation (12.11.25a), we have 


3 g(''y) 
dt 



e ~ ikr ij 

t 2 a+^ 2 k 2 )2 


k d ~ l dk. 


(14) 


Now, the values of k that are much larger than £ _1 contribute little to this integral; the only 
significant contributions come from the range (0, Lav ), where /c max = 0(£ -1 ); moreover, 
since r,y <$; £, the exponential for these values of k is essentially equal to 1. Expression (14) 
may, therefore, be written as 


3g(r,y) d I" k d - ] dk 

J t 2 a + s 2 k 2 ) 2 


(15) 


which, for d < 4, scales as (a/f)“f -2 ~ compare with (13a). We thus see that the 

most significant contribution to the criticality of the problem arises from fluctuations 
whose length scale, A: -1 , is of order £ or longer and hardly any contribution comes from 
fluctuations whose length scale is shorter. Now, it is only the latter that are likely to pick 
up the structural details of the system at the atomic level; since they do not play any sig¬ 
nificant role in bringing about the phenomenon, the precise nature of criticality remains 
independent of the structural details. This explains why a large variety of systems, differ¬ 
ing so much in their structure at the macroscopic level, may, insofar as critical behavior is 
concerned, fall into a single universality class. 


Problems 


12.1. Assume that in the virial expansion 


Pv 

kf 




(10.4.22) 


where ftj are the irreducible cluster integrals of the system, only terms withy = 1 andy = 2 are 
appreciable in the critical region. Determine the relationship between /}i and ft-/ at the critical 
point, and show that kT c /P c v c = 3. 

12.2. Assuming the Dietrici equation of state, 

P(v— b ) = kT exp (—a/kTv), 


evaluate the critical constants P c , v c , and T c of the given system in terms of the parameters a and 

b, and show that the quantity kT c /P c v c = e 2 /2 ~ 3.695. 

Further show that the following statements hold in regard to the Dietrici equation of state: 

(a) It yields the same expression for the second virial coefficient B 2 as the van der Waals equation 
does. 

(b) For all values of P and for T > T c , it yields a unique value of v. 

(c) For T <T C , there are three possible values of v for certain values of P and the critical volume 
v c is always intermediate between the largest and the smallest of the three volumes. 

(d) The Dietrici equation of state yields the same critical exponents as the van der Waals 
equation does. 
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12.3. Consider a nonideal gas obeying a modified van der Waals equation of state 

(P + a/v n )(v — b) = RT («>1). 

Examine how the critical constants P c , v c , and T c , and the critical exponents p, y, y', and S, of this 
system depend on the number n. 

12.4. Following expressions (12.5.2), define 

p=(l + L)/2 and q=(l-L)/2 (-1<L<1) (1) 

as the probabilities that a spin chosen at random in a lattice composed of N spins is either “up” or 
“down.” The partition function of the system may then be written as 

Q{B, T) = Y^g{De pN ^ q,L2+llBL) , (2) 

L 

where g(L) is the multiplicity factor associated with a particular value of L, that is, 

g(L) = N\/(Np)\(Nq)l; (3) 

note that in writing the Hamiltonian here we have made the assumption of random mixing, 
according to which 

(N++ + AL_ - N + -) = l -qN(p 2 + q 2 - 2pq) = \qNL 2 . 

(a) Determine the value, L*,ofL that maximizes the summand in (2). Check that L* is identical to 
the mean value, L, as given by equation (12.5.10). 

(b) Write down the free energy A and the internal energy U of the system, and show that the 
entropy S conforms to the relation 

S(B, T) = -Nk(p* In p* + q* In q*), 

where p* = p(L*) and q* = q(L*). 

12.5. Using the correspondence established in Section 12.4, apply the results of the preceding problem 
to the case of a lattice gas. Show, in particular, that the pressure, P, and the volume per particle, v, 
are given by 

P = pB — ^<7£ 0 (l+h 2 ) - ^ArTTn 

and 

!/-!= *(1±Z). 

Check that the critical constants ofthis system are: T c = qeo/4k, P c = kT c (ln2 — |),andty = 2, so 
that the quantity kT c /P c v c = l/(ln4 — 1) ~ 2.589. 

12.6. Consider an Ising model with an infinite-range interaction such that each spin interacts equally 
strongly with all other spins: 

H = ~ 

i<j i 

Express this Hamiltonian in terms of the parameter L(= AU 1 E/cr, ) and show that, in the limit 
N -> co and c-> 0, the mean field theory, with / = Nc/q, is exact for this model. 

12.7. Study the Heisenberg model of a ferromagnet, based on the interaction (12.3.6), in the mean field 
approximation and show that this also leads to a phase transition of the kind met with in the Ising 
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model. Show, in particular, that the transition temperature T c and the Curie-Weiss constant C are 
given by 

ql 2s(s+ 1) N(gp B ) 2 s(s + 1) 

Tc= T^- and ° = ^k -3“- 

Note that the ratio T c /CV = 2qJ/N(gp B ) 2 is the molecular field constant of the problem; compare 
to equation (12.5.8). 

12.8. Study the spontaneous magnetization of the Heisenberg model in the mean field approximation 
and examine the dependence of Lq on T (i) in the neighborhood of the critical temperature where 
(1 — T/T c ) <sc 1, and (ii) at sufficiently low temperatures where T/T c <g 1. Compare these results 
with the corresponding ones, namely (12.5.14) and (12.5.15) for the Ising model. 

[In this connection, it may be pointed out that, at very low temperatures, the experimental 
data do not agree with the theoretical formula derived here. We find instead a much better 
agreement with the formula L 0 = {1 -A(kT/J) 3 l 2 }, where A is a numerical constant (equal to 
0.1174 in the case of a simple cubic lattice). This formula is known as Bloch's 7’ 3, ' 2 -law and is 
derivable from the spin-wave theory of ferromagnetism; see Wannier (1966), Section 15.5.] 

12.9. An antiferromagnet is characterized by the fact that the exchange integral / is negative, which 
tends to align neighboring spins antiparallel to one another. Assume that a given lattice structure 
is such that the whole lattice can be divided into two interpenetrating sublattices, a and b say, so 
that the spins belonging to each sublattice, a as well as b, tend to align themselves in the same 
direction, while the directions of alignment in the two sublattices are opposite to one another. 
Using the Ising as well as Heisenberg type of interaction, and working in the mean field 
approximation, evaluate the paramagnetic susceptibility of such a lattice at high temperatures. 

12.10. The N&el temperature Tv of an antiferromagnet is defined as that temperature below which the 
sublattices a and b possess nonzero spontaneous magnetizations M a and Mj,, respectively. 
Determine Tjv for the model described in the preceding problem. 

12.11. Suppose that each atom of a crystal lattice can be in one of several internal states (which maybe 
denoted by the symbol o) and the interaction energy between an atom in state o' and its nearest 
neighbor in state a" is denoted by u(o',o"){= u(o",o')}. Let/(cr) be the probability of an atom 
being in a particular state o, independently of the states in which its nearest neighbors are. The 
interaction energy and the entropy of the lattice may then be written as 

E=^qNj2 u(o',o")f(o')f(o") 
o' ,a" 

and 

S/Nk = -'£/(?) ln/(«r), 


respectively. Minimizing the free energy ( E — TS ), show that the equilibrium value of the function 
f{o) is determined by the equation 

f(o) = Cexp{—(q/kT)'E a 'U(o,o')f(o')}, 


where C is the constant of normalization. Further show that, for the special case 
u(o',o") = —Jo'o", where the a can be either+1 or —1, this equation reduces to the Weiss 
equation (12.5.11), with/(a) = i(1 +L 0 o). 

12.12. Consider a binary alloy containing N A atoms of type A and N B atoms of type B, so that the relative 
concentrations of the two components are: x A = N A /(N A + N b ) < 4 and*# = N B /(N A + N B ) > f. 
The degree of long-range order, X, is such that 


A 

a 

B 

a 


] -Nx a (\+X), 

^N(x b -x a X), 


A 

b 


] -Nx a (1~X), 


B 

b 


1 

2 


N(x b + x a X), 
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where N = Na+Nb, while the symbol 


denotes the number of atoms of type A occupying 


sites of the sublattice a, and so on. In the Bragg-Williams approximation, the number of 
nearest-neighbor pairs of different kinds can be written down straightaway; for instance, 


AA 

ab 


1 

2 


qN ■ xa( 1+X) • xa( 1 - X), 


and so on. The configurational energy of the lattice then follows from equation (12.4.9). In the 
same approximation, the entropy of the lattice is given by S = k\n W, where 


W = 


dm dm 


Minimizing the free energy of the lattice, show that the equilibrium value ofX is determined by 
the equation 


X (2 qxAS \ 1 

^T^ = tanh (l^ X ) : e = 2 (eil + e22) - £l2>0 - 

Note that, in the special case of equal concentrations (xa = Xb = |), this equation assumes the 
more familiar form 

X=tanh (^&4 

Further show that the transition temperature of the system is given by 

T c = 4x A (l -x A )T°, 


where T°(= qs/2k) is the transition temperature in the case of equal concentrations. 

[Note-. In the Kirkwood approximation (see Kubo (1965), problem 5.19), 7’° turns out to be 
(e/fc){l — y[l — ( 4 /( 7 )] j" 1 , which may be written as (qs/2k)(l -l/q-\ —). To this order, the Bethe 
approximation also yields the same result.] 

12.13. Consider a two-component solution of Na atoms of type A and Nb atoms of type B, which are 
supposed to be randomly distributed over N(= Na -I- Nb) sites of a single lattice. Denoting the 
energies of the nearest-neighbor pairs AA,BB, and AB by £ 11 , 622 , and £ 12 , respectively, write down 
the free energy of the system in the Bragg-Williams approximation and evaluate the chemical 
potentials pa and pb of the two components. Next, show that if £ = (eh + £22 — 2 £i 2 ) < 0, that is, 
if the atoms of the same species display greater affinity to be neighborly, then for temperatures 
below a critical temperature T c , which is given by the expression q\e\/2k, the solution separates 
out into two phases of unequal relative concentrations. 

[Note-. For a study of phase separation in an isotopic mixture of hard-sphere bosons and 
fermions, and for the relevance of this study to the actual behavior of He 3 —He 4 solutions, see 
Cohen and van Leeuwen (1960, 1961).] 

12.14. Modify the Bragg-Williams approximation (12.5.29) to include a short-range order parameter s, 
such that 


lV ++ = 




(1 + 5), N— = -qNy 


1 -L 
2 


2 

U + 5), 


N, 


= 2 - 





U-5). 


(a) Evaluate y from the condition that the total number of nearest-neighbor pairs is \qN. 

(b) Show that the critical temperature T c of this model is (1 — s 2 )qj/k. 
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(c) Determine the nature of the specific-heat singularity at T = T C} and compare your result with 
both the Bragg-Williams approximation of Section 12.5 and the Bethe approximation of 
Section 12.6. 

12.15. Show that in the Bethe approximation the entropy of the Ising lattice at T = T c is given by the 
expression 


^ =ln2+ f ln 



gig-2) 
Mg — i) 


in 



Compare this result with the one following from the Bragg-Williams approximation, namely 
(12.5.20). 

12.16. Examine the critical behavior of the low-field susceptibility, xo, of an king model in the Bethe 
approximation of Section 12.6, and compare your results with equations (12.5.22) of the 
Bragg-Williams approximation. 

12.17. A function/(jt) is said to be concave over an interval (a, b) if it satisfies the property 


/{Axi + (1 - A)x 2 } > A/(xi) + (1 - A,)/(x 2 ), 


where xi and x 2 are two arbitrary points in the interval ( a , b) while A is a positive number in the 
interval (0,1). This means that the chord joining the points X\ andx 2 lies below the curve/(x). 
Show that this also means that the tangent to the curve/(x) at any point x in the interval (a, b) 
lies above the curve/(x) or, equivalently, that the second derivative 9 2 //3x 2 throughout this 
interval < 0. 

12.18. In view of the thermodynamic relationship 

C v = TV(d 2 P/dT 2 ) v - TN(d 2 p/dT 2 ) v 


for a fluid, p being the chemical potential of the system, Yang and Yang (1964) pointed out that, if 
Cy is singular at T = T c , then either (d 2 P/dT 2 )v or ( d 2 p/3T 2 )v or both will be singular. Define an 
exponent 0 by writing 

(d 2 P/dT 2 ) v ~ {T c -T)- @ (T<T C ), 

and show that (Griffiths, 1965b) 

0<ct' + p and 0 < (2 + ot'8)/(8 + 1). 

12.19. Determine the numerical values of the coefficients ri and s n of equation (12.9.5) in (i) the 
Bragg-Williams approximation of Section 12.5 and (ii) the Bethe approximation of Section 12.6. 
Using these values ofri and s 0 , verify that equations (12.9.4), (12.9.9), (12.9.10), (12.9.11), and 
(12.9.15) reproduce correctly the results obtained in the zeroth and the first approximation, 
respectively. 

12.20. Consider a system with a modified expression for the Landau free energy, namely 

xphlt, m) = —hm + q(t) + r(t)m 2 +s(t)m 4 + u(t)m 6 , 

with u(t) a fixed positive constant. Minimize x/r with respect to the variable m and examine the 
spontaneous magnetization mo as a function of the parameters r and .s'. In particular, show the 
following: 21 

(a) For r > 0 and s > — (3 m? - ) 1 / 2 , mo = 0 is the only real solution. 

(b) For r > 0 and —(4 ur) 1 ? 2 < s < — (Pur) 1 ! 2 , mo = 0 or ±mi, where m 2 = 2l ‘ l ‘ n s - However, the 

minimum of f at m 0 = 0 is lower than the minima at m 0 = ±mi, so the ultimate equilibrium 
value of mo is 0. 

(c) For r > 0 and s = -(4ur) 1 ^ 2 , m 0 = 0 or ±( r/u) 1 ^ 4 . Now, the minimum of xjr at m 0 = 0 is of the 
same height as the ones at mo = ±(r/u) l t i , so a nonzero spontaneous magnetization is as 
likely to occur as the zero one. 


21 To fix ideas, it is helpful to use (r,s)-plane as our ‘‘parameter space.” 
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(d) For r > 0 and 5 < —(4 ur) 1 ? 2 , mo = ±ni\ — which implies a first-order phase transition 
(because the two possible states available here differ by a finite amount in m). The line 
5 = —(4wr) 1 / 2 , with r positive, is generally referred to as a “line of first-order phase 
transitions." 

(e) For r = 0 and s < 0, mo = ±(2\s\fiu) 1 / 2 . 

(f) For r < 0, mo = ±m\ for all 5. As r -»■ 0, m\ 0 if 5 is positive. 

(g) For r = 0 and 5 > 0, mo = 0 is only solution. Combining this result with (f), we conclude that 
the line r = 0, with .s' positive, is a “line of second-order phase transitions,” for the two states 
available here differ by a vanishing amount in m. 

The lines of first-order phase transitions and second-order phase transitions meet at the point 
(r = 0, s = 0), which is commonly referred to as a tricritical point (Griffiths, 1970). 

12.21. In the preceding problem, put 5 = 0 and approach the tricritical point along the r-axis, setting 
r^r\t. Show that the critical exponents pertaining to the tricritical point in this model are 

a = - ,6 = -,y = 1, and 5 = 5. 

2 4 

12.22. Consider a fluid near its critical point, with isotherms as sketched in Figure 12.3. Assume that the 
singular part of the Gibbs free energy of the fluid is of the form 

G (s) (T,P)~|f| 2 -“g(7r/|f| A ), 

where n = (P - P c )/P c , t= (T — T c )/T c while g(x) is a universal function, with branches g+ for 
t > 0 and g for t < 0; in the latter case, the function g has a point of infinite curvature at a value 
of tv that varies smoothly with f, such that jt(0) = 0 and (dn/dt) t ^o = const. 

(a) Using the above expression for G (s) , determine the manner in which the densities, pi and p gt 
of the two phases approach one another as t -y 0 from below. 

(b) Also determine how (P - P c ) varies with (p — p c ) as the critical point is approached along the 
critical isotherm (t= 0). 

(c) Examine as well the critical behavior of the isothermal compressibility kj, the adiabatic 
compressibility /eg, the specific heats Cp and CV, the coefficient of volume expansion up, and 
the latent heat of vaporization l. 

12.23. Consider a model equation of state which, near the critical point, can be written as 

h^am(t+bm 2 ) & (1 < 0 < 2;a,b > 0). 


Determine the critical exponents /), y , and <5 of this model, and check that they obey the scaling 
relation (12.10.22). 

12.24. Assuming that the correlation function g(ri,rp is afunction only ofthe distance r= ry - r,|, show 
that g(r) for r y 0 satisfies the differential equation 


d 2 g 

dr 2 


d- 1 dg 
r dr 



Check that expression (12.11.27) for g(r) satisfies this equation in the regime r»(, while 
expression (12.11.28) does so in the regime r §. 

12.25. Consider the correlation function g(r; f, h) of Section 12.11 with h > 0. 22 Assume that this function 
has the following behavior: 


g(r) ~ e r /?0,U x S ome power of r (t > 0), 

such that §(0,h) ~ h~ l ’ c . Show that v c = v/A. 

Next, assume that the susceptibility / (0, h) ~ h~ Y<: . Show that y c = y/A = (8 — 1 )/8. 


22 For more details, see Tarko and Fisher (1975). 



Problems 469 


12.26. Liquid He 4 undergoes a superfluid transition at T ~ 2.17 K. The order parameter in this case is a 
complex number T, which is related to the Bose condensate density p 0 as 

po ~ I'Ll 2 ~ |f| 2/s (f<0). 

The superfluid density p St on the other hand, behaves as 

P,~\t\ v (f<0). 

Show that the ratio 23 


(Po/Ps)~\tr (f< 0). 

12.27. The surface tension, a, of a liquid approaches zero as T -> T c from below. Define an exponent /x 
by writing 

cr ~ |t|^ (f<0). 

Identifying a with the “free energy associated with a unit area of the liquid-vapor interface,” argue 
that p,= (d — l)v = (2 — a)(d— l)/d. 

[Note: Analysis of the experimental data on surface tension yields: p. = 1.27 ± 0.02, which 
agrees with the fact that for most fluids a ~ 0.1.] 


23 The corresponding ratio for a magnetic system is M^/V, where Mo is the spontaneous magnetization and F the 
helicity modulus of the system; for details, see Fisher, Barber, and Jasnow (1973). 



Phase Transitions: Exact (or Almost 
Exact) Results for Various Models 


In the preceding chapter we saw that the onset of a phase transition in a given physico¬ 
chemical system is characterized by (singular) features whose qualitative nature is deter¬ 
mined by the universality class to which the system belongs. In this chapter we propose 
to consider a variety of model systems belonging to different universality classes and ana¬ 
lyze them theoretically to find out how these features arise and how they vary from class to 
class. 

In this context, we recall that the parameters distinguishing one universality class from 
another are: (i) the space dimensionality d, (ii) the dimensionality of the order parameter, 
often referred to as the spin dimensionality, n, and (iii) the range of the microscopic inter¬ 
actions. As regards the latter, unless a statement is made to the contrary, we shall assume a 
short-range interaction which, in most cases, will be of the nearest-neighbor type; the only 
parameters open for selection will then be d and n. 

We start our analysis with the properties of one-dimensional fluids, the one¬ 
dimensional Ising {n = 1) model, and the general n-vector models (again in one dimen¬ 
sion). We then return to the Ising model — this time in two dimensions — and follow 
it with a study of two models in general d but with n —»• oo. These studies will give us a 
fairly good idea as to what to expect in the most practical, three-dimensional situations 
for which, unfortunately, we have no exact solutions — though a variety of mathematical 
techniques have been developed to obtain almost exact results in many cases of interest. 
For completeness, these and other results of physical importance will be revisited in the 
last section of this chapter. 


13.1 One-dimensional fluid models 

A system of interacting particles in one dimension can be solved analytically for sev¬ 
eral cases. In particular, a system of hard spheres in one dimension can be solved in the 
canonical ensemble (Tonks, 1936) while the one-dimensional isobaric ensemble allows 
the solution of a more general set of nearest-neighbor interactions. None of these mod¬ 
els exhibits a transition to an ordered phase but they do display many of the short-range 
correlation properties characteristic of fluids. 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00013-X 
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13.1.A Hard spheres on a ring 

The partition function of a one-dimensional system of hard spheres was first evaluated by 
Tonks (1936). For this we consider the case of N hard spheres with diameter D on a ring 
of circumference L, obeying periodic boundary conditions. The free energy, the pressure, 
and the pair correlation function of this system can be determined exactly both for finite 
N and in the thermodynamic limit. The hard sphere pair interaction is given by 


u(r) = 



for r < D 
for r > D. 


( 1 ) 


The configurational partition function can be written as an integral over the spatially 
ordered positions of the N spheres, with particle 1 set at x\ — 0, while the other particles 
labeled) — 2,3,...,N are restricted by the conditions Xj_ l + D < Xj < L — (N —j + 1)D: 

L-(N-l)D L—(N—2)D L-D >7 , 

L f f f L(L-ND ) n ~ 1 

Z n (L) = — / dx 2 I dx 3 - I dx N =-—-; (2) 

D x 2 +D x N _i +D 

the prefactor L here comes from integrating xi over the circumference of the ring, while the 
factor 1 /N delabels that final integral. The Helmholtz free energy in the thermodynamic 
limit turns out to be 


A(N,L, T) = —NkTln 


L-ND\ 
NX ) 


— NkT, 


(3) 


where X is the thermal deBroglie wavelength. The pressure is then given by 


P _/“) (4) 

\ 3 L)j N 1 — nD 

where n{—N/L) is the one-dimensional number density. This is equivalent to the pressure 
of an ideal gas of N particles in a free volume L — ND. The isothermal compressibility is 




(1 — nD) 2 
nkT 


= /cl deal (l - nD) 2 


(5) 


The pair correlation function for the particles on the ring can also be determined exactly 
(M. Foss-Feig, unpublished). This is accomplished by integrating over all configurations in 
which particle 1 is fixed at the origin and, in succession, each of the other particles is fixed 
at position x (if possible). This gives 


N -1 

8(x) = J2 SjW- 

i= i 


( 6 ) 
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FIGURE 13.1 The pair correlation function for a system of 12 hard disks on a ring. The scaled number density 
nD = 0.75. The solid line represents the exact solution from equations (6) and (7), as compared to a Monte Carlo 
simulation. 


where gj(x) is defined on the range jD < x < L — (N - j)D by 

/ UN-1)1 \ / (x - jD)i- 1 (L-x-(N- fiDy**- 1 -] \ 

8j(X \N(L - ND) N_1 (/ - 1)! (N- 1 —j)\ ) ’ 

see Figure 13.1. In the thermodynamic limit, the correlation function becomes 

OO 

g(x) = J2SJ (X) ’ 

7=1 

where gj(x) is defined on the range jD < x < oc by 

(fiP(x-jD))i- 1 exp(-/SP(x- jD)) 


gj(x) = 


(1 - nD)(j - 1)! 


(7) 


( 8 ) 


0) 


where f3P= n/( 1 — riD ); see Sells, Harris, and Guth (1953). Using equation (10.7.12) gives 
the correct virial equation of state pressure 


if- 1+nDstD+> -r^D ; 

where g(D + ) is the pair correlation function at contact; see Problem 13.28. 


( 10 ) 


13.1.B Isobaric ensemble of a one-dimensional fluid 

Takahashi (1942) has shown that a one-dimensional system of particles that interact with 
nearest neighbors via pair potential u(r ) can be analyzed analytically using an isobaric 
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ensemble. The potential energy of the system can be written as 

iV+l 

U = u( - x j ~ x j- 1 ), (ID 

t=i 

where the left and right walls are treated as particles fixed at Xo = 0 and xjv+i = L. The 
isobaric partition function is given by 


OO 


Y n (P,T) = 


J exp (~pPL)Q N (L,T)dL, 


( 12 ) 


o 

where X is the thermal deBroglie wavelength. Equation (12) can be factorized in terms of 
integrals over the distances between nearest neighbors y,- (= x,- - x,_i): 


cx_> 

U' 


N +1 


Y n (P, T) = - / exp (~pPy - p u(y))dy | = Fi (p, T) 


iV+l 


(13) 


The bulk Gibbs free energy is then given by 


G(N,P, T ) = —NkT\n(Yi(P, T)), 


(14) 


and the average system size at pressure P by 

(15 > 

where 


. . = /o°° yexp[-pPy-pu(y)]dy 
f™ exp[-pPy-pu(y)]dy 

is the average nearest-neighbor distance between the particles. The isothermal compress¬ 
ibility, 


L \ dP J j n 


N 

kJL 


(<y 2 ) - <y) 2 ), 


(17) 


is proportional to the variance of the nearest-neighbor distances. It is now easy to show 
that one-dimensional models cannot form a long-range ordered lattice. The average dis¬ 
tance between two particles labeled by i and j is (x, - xj) = (i - j)(y) but the variance 
is given by ((x ; - -xy) 2 ) = \i- j\((y 2 ) - (y> 2 ). Therefore, if a chosen particle is located on a 
particular lattice site, then a particle m sites away will on average be separated from it by 
m lattice spacings, but the variance of this particle’s position from that site location grows 
linearly with m. 
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In the thermodynamic limit, the structure factor S(k), equation (10.7.18), can be written 
in the form 


S(k) = 



Since (xj - xo) = y,-, the structure factor can be summed exactly to give 


(18) 


00 00 2 

s ( fc)=-t++E w 

7=0 7=0 


- + \Z\ 2 -Z-Z* 


where 


z = (e iky ) = /o°° ex P [-PPy-P u(y) + iky] dy 
' I exp[-/3Py-pu(y)]dy 


(19) 


( 20 ) 


and z* is the complex conjugate of z. The fluctuation-compressibility relation now gives: 
S(k -»■ 0) = KT/K l j eal = ((y 2 ) - (y) 2 ) / (y> 2 . For the particular case of hard spheres, the pres¬ 
sure is given by equation (4) and the structure factor is given by 


S(k) = 


(kD) 2 

(kD) 2 + 2(/SPD) 2 (l - cos(fcD)) + 2 (pPD)(kD) sin (kD) ’ 


( 21 ) 


see Figure 13.2. Equation (10.7.20a), applied to the hard sphere pair correlation function 
in equation (9), also gives equation (21). 



FIGURE 13.2 The structure factor S(k) for a system of hard spheres on a line at density nD= 0.75. The structure 
factor at k = 0 is S(0) = (1 — nD) 2 = »rj-//c^ eal . 
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13.2 The Ising model in one dimension 


In this section we present an exact treatment of the Ising model in one dimension. This is 
important for several reasons. First of all, there do exist phenomena, such as adsorption 
on a linear polymer or on a protein chain, the elastic properties of fibrous proteins, and so 
on, that can be looked upon as one-dimensional nearest-neighbor problems. Secondly, it 
helps us evolve mathematical techniques for treating lattices in higher dimensions, which 
is essential for understanding the critical behavior of a variety of physical systems met 
with in nature. Thirdly, it enables us to estimate the status of the Bethe approximation as 
a “possible” theory of the Ising model, for it demonstrates mathematically that, at least in 
one dimension, this approximation leads to exact results. 

In a short paper published in 1925, Ising himself gave an exact solution to this prob¬ 
lem in one dimension. He employed a combinatorial approach that has by now been 
superseded by other approaches. Here we shall follow the transfer matrix method, first 
introduced by Kramers and Wannier (1941). In the one-dimensional case, this method 
worked with immediate success. Three years later, in 1944, it became, through Onsager’s 
ingenuity, the first method to treat successfully the field-free Ising model in two dimen¬ 
sions. To apply this method, we replace the actual lattice by one having the topology of 
a closed, endless structure; thus, in the one-dimensional case we replace the straight, 
open chain by a curved one such that the IVth spin becomes a neighbor of the first 
(see Figure 13.3). This replacement eliminates the inconvenient end effects; it does not, 
however, alter the thermodynamic properties of the (infinitely long) chain. The impor¬ 
tant advantage of this replacement is that it enables us to write the Hamiltonian of the 
system, 


N 


H N {<Ti} = -jJ2 


ttB a i> 


( 1 ) 


n.n. 


i=l 


in a symmetrical form, namely 



( 2 ) 


/ 


\ 


N— 3 


N- 



FIGURE 13.3 An Ising chain with a closed, endless structure. 
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because crjv+i = <n • The partition function of the system is then given by 

r n ! i 

qn(b,t) = 53 ••• 53 ex p pJ2v aiai + i + 2 tlB( ' ai+ai+l ^ 

eri=±l cr^/=± 1 L 1—1 

cr\ =± 1 otv=± 1 


where P denotes an operator with matrix elements 


that is, 


(ai\P\a i+ i) = exp 


P {M-ffi+i + -iiB{oi + a i+1 ) 


\ e~V 


e-PJ 

e pa-iiB) 


(3a) 

(3b) 


(4) 


According to the rules of matrix algebra, the summations over the various a/ in equa¬ 
tion (3b) lead to the simple result 

Qn(B, T) — J2 <<r 1 |i >A Vi)= Trace (P N ) = ^ + x%, (5) 

<7l=±l 

where and >. 2 are the eigenvalues of the matrix P. These eigenvalues are given by the 
equation 


e -PJ e pd-tiB) _ ^ 


( 6 ) 


that is, by 


X 2 — 2Ae^cosh(/S fiB) + 2sinh(2/i/) = 0. 


(7) 


One readily obtains 


= eP J cosh(P/iB) ± [e 2 ^ + e 2 ^sinh 2 ( ( 8/r5)} 1//2 . (8) 

Quite generally, /. 2 < X ]; so, (a 2 //-i ) n —► 0 as N oo. Thus, it is only the larger eigenvalue, 
A.i, that determines the major physical properties of the system in the thermodynamic 
limit; see equation (5] . It follows that 


(9) 


^lnQjv(B,T) ^lnli 

= ln[e^cosh(/i/rB) + ( e~ 2 ^ + e 2 ^ sin^iPfiB)} 1 ^ 2 ]. 


( 10 ) 
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The Helmholtz free energy then turns out to be 

A(B, T) — —NJ — NkT In [cosh(/t pB) + [e~ 4 ^ + sinh 2 (fipB )) b 2 ]. 


( 11 ) 


The various other properties of the system follow readily from equation (11). Thus, 



2 NJe-W 


( 12 ) 


[cosh(/i/M.B) + (e -4 ^ + sinh 2 (flpB)} l l 2 ]{e~ i Pl + sinh 2 (yS/ui ?)} 1 / 2 


from which the specific heat can be derived, and 



(13) 


from which the susceptibility can be derived. 

Right away we note that, as B -»• 0,M (for all finite f J >) ->• 0. This rules out the possibility 
of spontaneous magnetization, and hence of a phase transition, at any finite temperature 
T. Of course, at T — 0, M (for any value of B) is equal to the saturation value Np, which 
implies perfect order in the system. This means that there is, after all, a phase transition at 
a critical temperature T c , which coincides with absolute zero! 

Figure 13.4 shows the degree of magnetization, M, of the lattice as a function of the 
parameter (figB) for different values of (fij). For ] — 0, we have the paramagnetic result 
M — Nptanh(/3pB); compare to equation (3.9.27). A positive J enhances magnetization 
and, in turn, leads to a faster approach toward saturation. As /3J -+ oo, the magnetiza¬ 
tion curve becomes a step function — indicative of a singularity at T = 0. The low-field 
susceptibility of the system is given by the initial slope of the magnetization curve; one 
obtains 



(14) 


M(B, T)/N/j. 





1 


> (P/J.B) 


FIGURE 13.4 The degree of magnetization of an Ising chain as a function of the parameter (P 11 B). 
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which diverges as T 0. It should be noted that the singularity here is not of the power-law 

type; it is exponential instead. 

The zero-field energy and the zero-field specific heat of the system follow from equa¬ 
tion (12); one gets 


U 0 (T) = -Njtanh(pj) 


(15) 


and 

C 0 (T) = Nk(fiJ) 2 sech 2 (/3J). (16) 

Figure 13.5 shows the variation of the specific heat Co as a function of temperature. 
Although it passes through a maximum, Co is a smooth function of T, vanishing as T -* 0. 
Note that equations (15) and (16) are identical to the corresponding equations, (12.6.29) 
and (12.6.30), of the Bethe approximation, with coordination number 2, for which T c = 0. 
It turns out that for a one-dimensional chain the Bethe approximation, in fact, yields exact 
results; for a fuller demonstration of this, see Problem 13.3. 

At this stage it seems instructive to express the free energy of the system, near its crit¬ 
ical point, in a scaled form, as in Section 12.10. Unfortunately, there is a problem here. 
Since T c = 0, the conventional definition, t — (T — T c )/T c , does not work. A closer look at 
equations (11) and (14), however, suggests that we may adopt instead the definition 

t = e-P J/kT (p > 0) (17) 

so that, as T -»• T c , t -»■ 0 as desired while for temperatures close to T c , t is much less than 
unity. The definition of h remains the same, namely pB/kT. The free energy function 
(A + NJ) /NkT then takes the form 

\// (s \t,h) = — ln[cosh/z + (f 4/,p + sinh 2 h) 1 ^ 2 ] (18a) 

ss -(t 4/p + /z 2 ) 1/2 (t,h« 1), (18b) 

which may be written in the scaled form 

is (s \t,h)^t 2/p f(h/t 2/ P). (19) 



FIGURE 13.5 The zero-field specific heat of an Ising chain as a function of temperature. 
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At the same time, equation (14) becomes 

xocn _ 2 ip 

(Nfi 2 /kT) ~ 


( 20 ) 


Comparing these results with the scaling formulation of Section 12.10, we infer that for this 
model 


a = 2 — 2/p, A = 2 /p, and y=2/p, (21) 

in conformity with the exponent relation (12.10.14b). Note that the exponents /J and 8 for 
this model cannot be defined in the normal, conventional sense. One may, however, write 
equation (13) in the form 


m = sinh/z/(t 4/,p + sinh 2 h) 1 ^ 2 (22a) 

S3 h/(t 4/p + h 2 ) 1/2 (.t , h « 1) (22b) 

= -t°f'(h/t 2/ P), (22c) 

suggesting that ft may formally be taken as zero. At the same time, since 

m\ t=0 = 1, (23) 

which is ~ h°, the exponent 8 may formally be taken as infinite. 

We now study spin-spin correlations in the Ising chain. For this, we set B = 0 but 
at the same time generalize the interaction parameter J to become site-dependent (the 
reason for which will become clear soon). The partition function of the system is then 
given by 


q N (T)= J2 ■■■ E n« w ° Wfl ; (24) 

cr\ =± 1 <Xjv=± 1 i 

compare to equation (3a). With B — 0, it is simpler to work with an open chain, which has 
only (N - 1) nearest-neighbor pairs; the advantage of this choice is that in the summand 
of (24) the variables o\ and on appear only once! A summation over either of these can be 
carried out easily; doing this over on, we have 

J2 e pjN -^ N -^ N = 2cosh(/3/jv-io-JV-i) = 2cosh(/?/jv_i), (25) 

CTjV=±l 


regardless of the sign of on- i. We thus obtain the recurrence relation 


Qn(T;Ji,. . -,Jn-i) = 2cosh(/)/jv_i)Qw_i(T;/i,... Jn-2)- 


(26) 
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By iteration, we get 


N -1 N -1 

Qn(T) = ]""[ {2cosh(/6/0} e coshf/S 

i= 1 <7i=±l i=l 


(27) 


so that 


^ 1 ^_* 

— In Q n (T) = ln2 + — lncosh00/;), (28) 

i=i 

which may be compared with equation (9) — remembering that, in the absence of the 
field, 7.] = 2 coshf/S/). We are now ready to calculate the correlation function, g(r), of the 
Ising chain. 

It is straightforward to see from equation (24) that 

C9) 

Substituting from equation (28), and remembering that W k = 0 at all finite temperatures, 
we obtain for the nearest-neighbor correlation function 


gkin.n.) = a k a k+ i = tanh(^/ fc ). 


For a pair of spins separated by r lattice constants, we get 

gk(r) = a k a k+r = {o k 0 k+1 ){o k+l o k+2 ) ■ ■ ■ (fffc+r-ifffc+r) (since all of = 1) 

k+r-l 

= Y[ tanhf pjj). 
i—k 

Reverting to a common /, we obtain the desired result 

g(r) = tanh r (/S/), 

which may be written in the standard form 

g(r) = e~ r if with £ = [lncoth(/6/)] _1 . 


For /6/» 1, 



(30) 


(31) 


(32) 


(33a, b) 


(34) 
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which diverges as T —s 0. In terms of the variable t, as defined in equation (17), 

f ~ t~ 2/p (f« 1), (35) 

giving v — 2/p. And since our g(r) does not contain any power of r, we infer that 
(d - 2 + rj) — 0 — giving q — 1; one may check that the same result follows from equation 
(12.12.10) or (12.12.11). In passing, we note that, regardless of the choice of the number p 
in defining t, we have for this model 


y = v = 2 — a. (36) 

We further note that, since d = 1 here, the hyperscaling relation, dv = 2 — a, is also obeyed. 
Finally, we observe that expression (33b) for % is in conformity with the general result 

? _1 = ln(MA 2 )> (37) 

where X\ is the largest eigenvalue of the transfer matrix P of the problem and X 2 the 
next largest; for a derivation of this result, see Section 5.3 of Yeomans (1992). In our case, 
M = 2cosh (fij) and X 2 = 2sinh(/i/), see equation (8) with B — 0, and hence expression 
(33b) for £. 


13.3 The n-vector models in one dimension 

We now consider a generalization of the Ising chain in which the spin variable a / is an 
//-dimensional vector of magnitude unity, whose components can vary continuously over 
the range —1 to +1; in contrast, the Ising spin cq could have only a discrete value, +1 or —1. 
We shall see that the vector models (with n > 2), while differing quantitatively from one 
another, differ rather qualitatively from the scalar models (for which n— 1). While some of 
these qualitative differences will show up in the present study, more will become evident 
in higher dimensions. Here we follow a treatment due to Stanley (1969a,b) who first solved 
this problem for general n. 

Once again we employ an open chain composed of N spins constituting (IV—1) 
nearest-neighbor pairs. The Hamiltonian of the system, in zero field, is given by 


IV-1 

H N Wi\ = - £//a, ■ a i+ 1 . (1) 

i= l 

We assume our spins to be classical, so we do not have to worry about the commutation 
properties of their components. And since the components oi a (a = 1,..., n) of each spin 
vector <t/ are now continuous variables, the partition function of the system will involve 
integrations, rather than summations, over these variables. Associating equal a priori 
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probabilities with solid angles of equal magnitude in the n -dimensional spin-vector space, 
we may write 


e i>Ji°i°i+ i, (2 ) 

J Q(n) Q.(n) 

where Q(n) is the total solid angle in an n -dimensional space; see equation (7b) of 
Appendix C, which gives 


Q(n) = 2n nl2 /r(n/2). (3) 

We first carry out integration over on, keeping the other <r, fixed. The relevant integral to 
do is 

f e P ,N - iaN - l ' aN dQ. N . (4) 

Q(n) J 

For <r N we employ spherical polar coordinates, with polar axis in the direction of ffjv-i, 
while for dQ.^ we use expression (9) of Appendix C. Integration over angles other than the 
polar angle 9 yields a factor of 


2jt ( "“ 1)/2 /T{(«- l)/2). 


(5) 


The integral over the polar angle is 


[ e^ cose sm n - 2 ede = Jrl/2r{( ” 1)/2} I(n-2)/2WN-l), 

J dpj N - 1) ( «-2)/ 2 w 

n z 


( 6 ) 


where I/, (x) is a modified Bessel function; see Abramowitz and Stegun (1964), formula 
9.6.18. Combining (3), (5), and (6), we obtain for (4) the expression 


V(n/ 2) 

regardless of the direction of on- i- By iteration, we get 


(7) 


N -1 


Qiv= n 


T{n/2) 


U (z PJ0 (n ~ 2)/2 


I(n—2)/2) (PJi)> 


( 8 ) 


the last integration, over dQ\, gave simply a factor of unity. 

Expression (8) is valid for all n — including n — 1, for which it gives: Qn = 11/ cosh (/J/;). 
This last result differs from expression (13.2.27) by a factor of 2 N ; the reason for this dif¬ 
ference lies in the fact that the Qn of the present study is normalized to go to unity as the 
PJi go to zero [see equation (2)] whereas the Q\; of the preceding section, being a sum over 
2 n discrete states [see equation (13.2.24)] goes to 2 N instead. This difference is important 
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in the evaluation of the entropy of the system; it is of no consequence for the calculations 
that follow. 

First of all we observe that the partition function Q,y is analytic at all ft — except 
possibly at ft = oo where the singularity of the problem is expected to lie. Thus, no long- 
range order is expected to appear at any finite temperature T — except at T — 0 where, 
of course, perfect order is supposed to prevail. In view of this, the correlation function for 
the nearest-neighbor pair (<r k ,(T k+ i) is simply aft, • cr k+1 and is given by, see equations (2) 
and (8), 


gk(n-n-) 


_L/1_9_\ In/2(ftJk) 
QN\ftdJk) N hn-2)/2(ftJk) 


The internal energy of the system turns out to be 


U 0 


d 

dft 


N -1 

(InQ/v) = -J2 Ji 
l=i 


In/2(PJi) 

ftn-2)/2(ftJi) 


0) 


( 10 ) 


not surprisingly, Uo is simply a sum of the expectation values of the nearest-neighbor 
interaction terms —//<r/ ■ oq+i, which is identical to a sum of the quantities —Jigfn.n.) over 
all nearest-neighbor pairs in the system. 

The calculation of g k (r) is somewhat tricky because of the vector character of the spins, 
but things are simplified by the fact that we are dealing with a one-dimensional system 
only. Let us consider the trio of spins <r k ,cr k+ i and a k+2 , and suppose for a moment that our 
spins are three-dimensional vectors; our aim is to evaluate a k ■ ar k+2 . We choose spherical 
polar coordinates with polar axis in the direction of a k+ p, let the direction of a k be defined 
by the angles (6 0 ,(p 0 ) and that of a k+2 by (0 2 ,4>2)- Then 


a k • a k +2 = cos9(k,k + 2) = cos0 O cos 02 + sin 0 osin 02 cos( 0 o — <fe)- (11) 


Now, with ar k+ i fixed, spins a k and a k+2 will orient themselves independently of one 
another because, apart from a k+ i, there is no other channel of interaction between them. 
Thus, the pairs of angles (6o,<Po) and {(h,(h ) vary independently of one another; this makes 
cos(</>o — 0 2 ) = 0 and cosdo cos $2 = cos(Oo) cos( 02 )- It follows that 


fffc • <r k +2 = <*k ' ff/t+l <rfc+l • ak+ 2 - 


( 12 ) 


Extending this argument to general n and to a segment of length r, we get 


k+r -1 k+r -1 

gk(r)= ]~[ g,:(n.n.)= [~[ I n / 2 (ftJi)/I(n- 2 )/ 2 (ft>Ji)- 
i=k i—k 


(13) 
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With a common /, equations (9), (10), and (13) take the form 

(14) 

(15) 


g(n.n.) = In/2(PJ)/kn-2)/2(fiJ), 

Uo = ~(N - l)JI n /2(PJ)/I(n-2)/2(PJ) 


and 


g(r) = {I n /2 (flJ) /I(n— 2)/2 (PJ) } r • (16) 

The last result here may be written in the standard form e~ r ^, with 

§ = [ln{/ ( „_ 2) /2 (/S/)/7„/2 (/37)}] 1 - (17) 

For n— 1, we have: /i/ 2 (x)//-i/ 2 (x) = tanhx; the results of the preceding section are then 
correctly recovered. 

For a study of the low-temperature behavior, where /3J » 1, we invoke the asymptotic 
expansion 


f, t (x) = 


e* 

V (2 TCX) 



(X » 1 ), 


with the result that 


g(n.n.) 


Uo 


n — 1 

1-, 

2 PJ 


-(IV-1)/ 



n— 1" 

~2PT_ 


(18) 


(14a) 

(15a) 


and 


$ 


2 pj 
n—l 


T -i 


(17a) 


Clearly, the foregoing results hold only for n > 2; for n = 1, the asymptotic expansion (18) 
is no good because it yields the same result for q, = \ as for \i — - In that case one is 
obliged to use the closed form result, g{n.n.) = tanh(/)/), which for /3J » 1 gives 


g(n.n.) ~ 1 - 2e~ m 


(14b) 


Uo « —(N— 1)/[1 — 2e~ 2 P J ] 


(15b) 


and 





(17b) 
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in complete agreement with the results of the preceding section. For completeness, we 
have for the low-temperature specific heat of the system 

I -(n—l)k for n> 2 (19a) 

2 

4k(fiJ) 2 e~ 2pj for n — 1. (19b) 

The most obvious distinction between one-dimensional models with continuous sym¬ 
metry (n > 2) and those with discrete symmetry (n — 1) is in relation to the nature of the 
singularity at T — 0. While in the case of the former it is a power-law singularity, with 
critical exponents 1 


a = 1, v = 1 , >7 = 1, and hence y = 1 , (20) 

in the case of the latter it is an exponential singularity. Nevertheless, by introducing the 
temperature parameter t — e~ p ^ J , see equation (13.2.17), we converted this exponential 
singularity in T into a power-law singularity in t, with 


a = 2 — 2/p, y = v = 2/p, > 7 = 1 . 


( 21 ) 


However, the inherent arbitrariness in the choice of the number p left an ambiguity in the 
values of these exponents; we now see that by choosing p = 2 we can bring exponents (21) 
in line with (20). 

Next we observe that the critical exponents (20) for n > 2 turn out to be independent of 
n — a feature that seems peculiar to situations where T c = 0. In higher dimensions, where 
T c is finite, the critical exponents do vary with n; for details, see Section 13.7. In any case, 
the amplitudes always depend on n. In this connection we note that, since each of the 
N spins comprising the system has n components, the total number of degrees of free¬ 
dom in this problem is Nn. It seems appropriate that the extensive quantities, such as Uq 
and Co, be divided by Nn, so that they are expressed as per degree of freedom. A look at 
equation (15a) then tells us that our parameter J must be of the form nf, so that in the 
thermodynamic limit 


U o 
Nn 


-J' + 


n— 1 
2 n 


kT 


( 22 ) 


and, accordingly, 


Cp _ n- 1 
Nn 2 n 


(23) 


'Note that, since T c = Ohere, the assertion that the free energy function ft = ( A/NkT) ~ (T — T c ) 2 ~ a implies that, near 
T = T c in the present case, A ~ T 3 ~ a ; accordingly, the specific heat Co ~ T 2 ~ a . Comparison with expression (19a) would 
be inappropriate because Co, in the limit T -+ 0, cannot be nonzero; the result quoted in (19a) is an artifice of the model 
considered, which must somehow be “subtracted away.” The next approximation yields a result proportional to the first 
power in T, giving a = 1. For a parallel situation, see equation (13.5.35) for the spherical model (which pertains to the 
case > 2 = oo). 
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Equation (17a) then becomes 


2 n /' 
n — 1 kT' 


(24) 


Note that the amplitudes appearing in equations (22) through (24) are such that the limit 
n -*■ oo exists; this limit pertains to the so-called spherical model, which will be studied in 
Section 13.5. 

Figure 13.6 shows the normalized energy u 0 (= Uq/NuJ') as a function of temperature 
for several values of n, including the limiting case n = 1. We note that u 0 (which, in the 
thermodynamic limit, is equal and opposite to the nearest-neighbor correlation function 
g(n.n.)) increases monotonically with n — implying that g(n.n.), and hence g(r), decrease 
monotonically as n increases. This is consistent with the fact that the correlation length f 
also decreases as n increases; see equation (24). The physical reason for this behavior is 
that an increase in the number of degrees of freedom available to each spin in the system 
effectively diminishes the degree of correlation among any two of them. 

Another feature emerges here that distinguishes vector models (ri > 2) from the scalar 
model (n = 1); this is related to the manner in which the quantity Uo approaches its 
ground-state value — 1. While for n — 1, the approach is quite slow—leading to a vanishing 



FIGURE 13.6 The normalized energy «o(= Ua/NnJ') of a one-dimensional chain as a function of the temperature 
parameter kT/]' for several values of n (after Stanley, 1969a,b). Note that for this classical model, the slopes as 
0 are given by the equipartition theorem for n- 1 degrees of freedom per spin. 
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specific heat, see equations (15b) and (19b) — for n > 2, the approach is essentially linear 
in T, leading to a. finite specific heat; see equations (15a) and (19a). This last result violates 
the third law of thermodynamics, according to which the specific heat of a real system 
must go to zero as T -*■ 0. The resolution of this dilemma lies in the fact that the low-lying 
states of a system with continuous symmetry (n > 2) are dominated by long-wavelength 
excitations, known as Goldstone modes, which in the case of a magnetic system assume 
the form of “spin waves,” characterized by a particle-like spectrum: co(k) ~ k 2 . The very 
low-temperature behavior of the system is primarily governed by these modes, and the 
thermal energy associated with them is given by 


^thermal 


7 


hco 


exp(hw / k^T) — 1 


k d ~ l dk ~ T (d+2 h 2 ; 


(25) 


this results in a specific heat— T d/2 , which indeed is consistent with the third law. For 
a general account of the Goldstone excitations, see Huang (1987); for their role as “spin 
waves” in a magnetic system, see Plischke and Bergersen (1989). 

For further information on one-dimensional models, see Lieb and Mattis (1966) and 
Thompson (1972a,b). 


13.4 The Ising model in two dimensions 

As stated earlier, Ising (1925) himself carried out a combinatorial analysis of the one¬ 
dimensional model and found that there was no phase transition at a finite temperature T. 
This led him to conclude, erroneously though, that his model would not exhibit a phase 
transition in higher dimensions either. In fact, it was this “supposed failure” of the Ising 
model that motivated Heisenberg to develop, in 1928, the theory of ferromagnetism based 
on a more sophisticated interaction among the spins; compare the Heisenberg interac¬ 
tion (12.3.6) with the Ising interaction (12.3.7). It was only after some exploitation of the 
Heisenberg model that people returned to investigate the properties of the Ising model. 

The first exact, quantitative result for the two-dimensional Ising model was obtained 
by Kramers and Wannier (1941) who successfully located the critical temperature of the 
system. They were followed by Onsager (1944) who derived an explicit expression for the 
free energy in zero field and thereby established the precise nature of the specific-heat 
singularity. These authors employed the transfer matrix method that was introduced in 
Section 13.2 to solve the corresponding one-dimensional problem; its application to the 
two-dimensional model, even in the absence of the field, turned out to be an extremely 
difficult task. Although some of these difficulties were softened by subsequent treatments 
due to Kaufman (1949) and to Kaufman and Onsager (1949), it seemed very natural to look 
for other simpler approaches. 

One such approach was developed by Kac and Ward (1952), later refined by Potts and 
Ward (1955), in which combinatorial arguments were used to express the partition func¬ 
tion of the system as the determinant of a certain matrix A. This method throws special 
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light on the “topological conditions” that give rise to an exact solution in two dimensions 
but are clearly absent in three dimensions; a particularly lucid account of this method has 
been given by Baker (1990). In 1960 Hurst and Green introduced yet another approach 
to investigate the Ising problem; this involved the use of “triangular arrays of quantities 
closely related to antisymmetric determinants” and became rightly known as the method 
of Pfaffrans. This method applies rather naturally to the study of “configurations of dimer 
molecules on a given lattice” which, in turn, is closely related to the Ising problem; for 
details, see Kasteleyn (1963), Montroll (1964), and Green and Hurst (1964). A pedagog¬ 
ical account of the approach through Pfaffians is given in Thompson (1972b), where a 
comprehensive treatment of the original, algebraic approach can also be found. Another 
combinatorial solution, which is generally regarded as the simplest, was obtained by 
Vdovichenko (1965) and by Glasser (1970), and is readily accessible in Stanley (1971). For 
an exhaustive account of the two-dimensional Ising model, see McCoy and Wu (1973). 

We analyze this problem with the help of a combinatorial approach assisted, from time 
to time, by a graphical representation. The zero-field partition function of the system is 
given by the familiar expression 

Q(N, T) = EIK* 9 (K =J/kT). (1) 

tml n - n - 

Our first step consists of carrying out a high-temperature and a low-temperature expan¬ 
sion of the partition function and establishing an intimate relation between the two. 

High-temperature expansion 

Since the product (oyoy) can only be +1 or —1, we may write 

e Ka i a i — cosh^f + oyoy sinhiC = cosh_KT(l + ayoyhf v = tanh^T. (2) 

The product over all nearest-neighbor pairs then takes the form 

n eKa ‘ ai = (cosh K)-* n (1 + trtajv), (3) 

n.n. n.n. 

jV being the total number of nearest-neighbor pairs on the lattice; for a lattice with peri¬ 
odic boundary conditions, qN where q is the coordination number. The partition 

function may then be written as 


Q(N, T) = (cosh Ky /V J2 J2 

< 71=±1 cr N =±\ 


i+vj2°i°j+v 2 J2Y, a i a j a k a l + • ■ • 


«J) 


(i<7) (k,ti 


(4) 


Now we represent each product (ayay) appearing in (4) by a “bond connecting sites i and 
j on the given lattice”; then, each coefficient of v r in the expansion would be represented 
by a “graph consisting of r different bonds on the lattice.” Figure 13.7 shows all possible 
graphs, with r = 1 and 2, on a square lattice. Notice that in each case we have some of 
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FIGURE 13.7 Graphs with r— 1 and r = 2 on a square lattice. Graph (i) is for r = 1. The r = 2 graphs are of two types, 
ones that do not include a common site as in (ii)a and ones that do include a common site as in (ii)b. 



FIGURE 13.8 Examples of closed graphs with r bonds on a square lattice. 


the oi appearing only once in the term, which makes all these terms vanish on summation 
over {ct,;}. The same is true for r = 3. Only when we reach r = 4 do we receive a nonvanish¬ 
ing contribution from terms of the type {oiOjOjo^a^aiaiaO = 1 which, on summation over 
{a/}, yield a contribution of 2 N each. It is obvious that a nonvanishing term corresponds 
to a graph in which each vertex is met by an even number of bonds — making the graph 
necessarily a closed one; see Figure 13.8, where some other closed graphs are also shown. 
In view of these observations, expression (4) may be written as 

Q(N, T) = 2 W (coshJTT 4 n(r)v r [n(0) = l], (5) 

r 

where n{r) is the number of graphs that can be drawn on the given lattice using r bonds 
such that each vertex of the graph is met by an even number of bonds. For simplic¬ 
ity, we shall refer to these graphs as closed graphs. Our problem thus reduces to one of 
enumerating such graphs on the given lattice. 

Since v — tanh (J/kT), the higher the temperature the smaller the v. Expansion (5) is, 
therefore, particularly useful at higher temperatures (even though it is exact for all T ). As 
an illustration, we apply this result to a one-dimensional Ising chain. In the case of an 
open chain, no closed graphs are possible, so all we get from (5) is the term with r — 0; with 
Jf — N - 1, this gives 


Q(N, T) = 2 N (coshK) N ~ 1 


(6) 
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which agrees with our previous result (13.2.27). In the case of a closed chain, we do have a 
closed graph — the one with r = N;we now get (with JY = N) 

Q{N, T) = 2 JV (cosh.Ky v [l + iA] = 2 w [(cosh^) JV + (sinh^)"], (7) 

which agrees with expression (13.2.5), with (Ai)b=o = 2cosh,Rr and ( ^ 2 )b=o — 2sinhf<f. 

Low-temperature expansion 

We start with the observation that the ground state of the system consists of all spins 
aligned in the same direction, with the total energy Eq = —J,/E. As one spin is flipped, q 
unlike nearest-neighbor pairs are created at the expense of like ones, and the energy of the 
system increases by an amount 2 qj. It seems appropriate, therefore, that the Hamiltonian 
of the system be written in terms of the number, Ai + _, of unlike nearest-neighbor pairs in 
the lattice, that is, 


H(N+_) = -W++ + JV__ - AT+_) = -J{Jk - 2AT+_). 


( 8 ) 


The partition function of the system may then be written as 

Q(N, T) = e KA ^m(r)e _ZKr [m( 0) = 1], (9) 

r 

where m(r) denotes the “number of distinct ways in which the N spins of the lattice can be 
so arranged as to yield r unlike nearest-neighbor pairs.” It is obvious that the first nonzero 
term in (9), after the one with r = 0, would be the one with r = q. 

A graphical representation of the number m(r) is straightforward. Referring to 
Figure 13.9, which pertains to a square lattice, we see that each term in expansion (9) can 
be associated with a closed graph that cordons off region(s) of “up” spins from those of 
“down” spins, the perimeter of the graph being precisely the number of unlike nearest- 
neighbor pairs in the lattice for that particular configuration. Our problem then reduces 
to one of enumerating closed graphs, of appropriate perimeters, that can be drawn on the 
given lattice. 



FIGURE 13.9 Graphs cordoning off regions of "up" spins from those of "down" spins, with r unlike 
nearest-neighbor pairs. 
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Now, since expansion (9) is a power series in the variable e~ 2K that increases as T 
increases, this expansion is particularly useful at lower temperatures (even though it is 
exact for all T). We shall now establish an important relation between the coefficients 
appearing in expansion (5) and the ones appearing in (9). 

The duality transformation 

To establish the desired relation we construct a lattice dual to the one given. By definition, 
we draw right bisectors of all the bonds in the lattice, so that the points of intersection of 
these bisectors become the sites of the new lattice. The resulting lattice may not be similar 
in structure to the one we started with; for instance, while the dual of a square lattice is 
itself a square lattice, the dual of a triangular lattice (q = 6) is, in fact, a honeycomb lattice 
(q — 3), and vice versa; see Figures 13.10 and 13.11. The argument now runs as follows: 

We start with the given lattice on which one of the n(r) closed graphs, with r bonds, 
is drawn and construct the lattice dual to this one, placing spins of one sign on the sites 
inside this graph and spins of opposite sign on the sites outside. Then this graph represents 
precisely a configuration with r unlike nearest-neighbor pairs in the dual lattice and hence 
qualifies to be counted as one of the m(r ) graphs on the dual lattice. Conversely, if we start 
with one of the m(r) graphs, of perimeter r, representing a configuration with r unlike 
nearest-neighbor pairs in the original lattice and go through the process of constructing 
the dual lattice, then this graph will qualify to be one of the n(r ) closed graphs, with r 
bonds, on the dual lattice. In fact, there is a one-to-one correspondence between graphs 
of one kind on the given lattice and graphs of the other kind on the dual lattice; compare 
Figure 13.8 with Figure 13.9. It follows that 

n(r) = mod) and m(r) = no(r), (10a, b) 



FIGURE 13.10 A square lattice and its dual (which is also square). 
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where the suffix D refers to the dual lattice. 

With relations (10) established, we go back to equation (9) and introduce another 
temperature variable K*(= J/kT*) such that 


tanh K* = e~ 2K ; (11) 

note that equation (11) can also be written in the symmetrical form 

sinh(2_KT) sinh(2f*:*) = 1. (12) 

Substituting (10b) and (11) into (9), we get 

Q(N, T) = e K ' /Y n D (r)v* r , v* = tanhlC*. (13) 

r 

At the same time we apply equation (5) to the dual lattice at temperature T*, to get 

Qd(N d , T*) = 2 Jv fl(cosh K*)-* Y, n D (r)v* r , (14) 

r 

where No = qN/q o', see again Figure 13.11. Comparing (13) and (14), we arrive at the 
desired relation 


Q(N, T) = 2^ /v «(sinh /C* coshK*r A ' /2 Q D (N D , T*) 


(15) 






















494 Chapter 13 • Phase Transitions: Exact (or Almost Exact) Results for Various Models 


which relates the partition function of the given lattice at temperature T to that of the dual 
lattice at temperature T*. Equation (15) constitutes the so-called duality transformation. 

Location of the critical point 

For a square lattice, which is self-dual, there should be no distinction between Q and Qjj. 
With q — 4, and hence ,/K = 21V, equation (15) becomes 

Q(N, T) = [sinh(2^T*)] _w Q(lV, T*), (16) 


which may also be written as 

lsinh(2K)]- N/2 Q(N, T) = [sinh(2K*)r N l 2 Q(N, T*). (17) 

It will be noted from equation (11) or (12) that as T -* oo, T* —>■ 0 and as T -> 0, T* -»• 
oo; equation (17), therefore, establishes a one-to-one correspondence between the high- 
temperature and the low-temperature values of the partition function of the lattice. It then 
follows that if there exists a singularity in the partition function at a particular temperature 
T c , there must exist an equivalent singularity at the corresponding temperature T*. And in 
case we have only one singularity, as indeed follows from one of the theorems of Yang and 
Lee (1952), it must exist at a temperature T c such that T* — T c . The critical temperature of 
the square lattice is, therefore, given by the equation, see formula (12), 

sinh(2fsf c ) = 1, (18) 


which gives 


K c = ^ sinh' 1 1 = i ln(V2 + 1) = ^ lncot(jr/8) ~ 0.4407. (19) 

For comparison, we note that for the same lattice the Bragg-Williams approximation gave 
K c = 0.25 while the Bethe approximation gave K c = \ In2 ~ 0.3466. 

The situation for other lattices such as the triangular or the honeycomb, which are not 
self-dual, is complicated by the fact that the functions Q and Qd in equation (15) are not 
the same. One then needs another trick — the so-called star-triangle transformation — 
which was first alluded to by Onsager (1944) in his famous paper on the solution of the 
square lattice problem but was written down explicitly by Wannier (1945); for details, see 
Baxter (1982). Unlike the duality transformation, this one establishes a relation between 
a high-temperature model on the triangular lattice and again a high-temperature model 
on the honeycomb lattice, and so on. Combining the two transformations, one can elim¬ 
inate the dual lattice altogether and obtain a relation between a high-temperature and 
a low-temperature model on the same lattice. The location of the critical point is then 
straightforward; one obtains for the triangular lattice (q = 6) 


1 


Kc = - sinfV 1 -4- ~ 0.2747, 

2 V3 


1 


( 20 ) 
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and for the honeycomb lattice (q — 3) 

Kc = * sinh -1 V3 - 0.6585. (21) 

The numerical values of K c , given by equations (19) through (21), reinforce the fact that 
higher coordination numbers help propagate long-range order in the system more effec¬ 
tively and hence raise the critical temperature T c . 

The partition function and the specific-heat singularity 

The partition function of the Ising model on a square lattice is given by, see references cited 
at the beginning of this section, 

71 / 2 

1 i r 

— lnQ(D = ln{2 1/2 cosh(210}+- / dcpln{l + V(1 - k 2 sin 2 cp)}, (22) 

N re J 

o 


where 


k = 2 sinh(2^T) / costr (2K) . (23) 

Differentiating (22) with respect to — ft, one obtains for the internal energy per spin 

lfU 0 (T) = - 2/tanh(2*0 + - ( k ^ 

N TC \ dp 


7r/2 

x J dtp 


sin 2 <p 

{1 + V(1 — k 2 sin 2 </>)}\/(l — k 2 s\v?<P) 


(24) 


Rationalizing the integrand, the integral in (24) can be written as 


1 f-f+*!«}, 


(25) 


where K\{k) is the complete elliptic integral of the first kind, k being the modulus of the 
integral: 


K\(k) = 


7r/2 

/ 


V (1 —k 2 sin 2 <p) 
u 

Now, a logarithmic differentiation of (23) with respect to ft gives 


(26) 


- ^ = 2/{coth(2kT) - 2tanh(2tsT)). 
k dft 


(27) 
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Substituting these results into (24), we obtain 



(28) 


where k' is the complementary modulus: 


k' = 2tanh 2 (2fO-1 (k 2 + k ' 2 = 1). 


(29) 


Figure 13.12 shows the variation of the moduli k and k' with the temperature parame¬ 
ter ( kT/]) = K~ l . We note that, while k is always positive, k' can be positive or negative; 
actually, k lies between 0 and 1 while k' lies between -1 and 1. At the critical point, where 
sinh(21Q) = 1 and hence K~ l ~ 2.269, the moduli k and k' are equal to 1 and 0, respectively. 

To determine the specific heat of the lattice, we differentiate (28) with respect to 
temperature. In doing so, we make use of the following results: 



(30) 


and 


cIKi(k) 1 
ch< k' 2 k 


{Ei(k) — k ,2 Ki(k)}, 


(31) 


where E\ (k) is the complete elliptic integral of the second kind: 



(32) 


o 


K, K 


K 


0 



> kTIJ 


-1 


FIGURE 13.12 Variation of the moduli k and k' with (kT/J). 
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We finally obtain 

^Co(T) = ?-{Kcoth(2K)} 2 [2{Ki(*) -Ei(k)} - (1-0 j | +k'Ki(k) j] . (33) 

Now, the elliptic integral K\ (at) has a singularity at k = 1 (i.e., at k' = 0), in the neighbor¬ 
hood of which 


K\(k) a* ln{4/|*r'|} and £i(>c)^l. 


(34) 


Accordingly, the specific heat of the lattice displays a logarithmic singularity at a tempera¬ 
ture T c , given by the condition: k c = 1 (or k' c — 0), which is identical to (18). In the vicinity 
of the critical point, equation (33) reduces to 

^Q,m*f*?[lnl4/l«'l)-(l+I)]; (35) 

at the same time, the parameter k’ reduces to, see equation (30), 

k' ~2J2K c (l-^r). (36) 


The specific heat singularity is, therefore, given by 


Nk CoiT ^n K ‘ 


-In 


~ -0.4945In 


T 

1 c 

T 

1- 

T c 


+ |l "(l 


+ const., 


UY)\ 


(37) 


signaling a logarithmic divergence at T = T c . 

Figures 13.13 and 13.14 show the temperature dependence of the internal energy and 
the specific heat of the square lattice, as given by the Onsager expressions (28) and (33); for 
comparison, the results of the Bragg-Williams approximation and of the Bethe approx¬ 
imation (with q = 4) are also included. The specific-heat singularity, given correctly by 
the Onsager expression (37), is seen as a (logarithmic) peak in Figure 13.14, which differs 
markedly from the jump discontinuity predicted by the mean field theory. We conclude 
that the critical exponent a — a' — O(log). 

In passing, we note that the internal energy of the lattice is continuous at the critical 
point, having a value of —J2.J per spin and an infinite, positive slope; needless to say, the 
continuity of the internal energy implies that the transition takes place without any latent 
heat. 


Other properties 

We now consider the temperature dependence of the order parameter (i.e., the sponta¬ 
neous magnetization), of the lattice. An exact expression for this quantity was first derived 
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kT/J 



FIGURE 13.13 The internal energy of a square lattice (q = 4) according to (1) the Onsager solution, (2) the Bethe 
approximation, and (3) the Bragg-Williams approximation. 



FIGURE 13.14 The specific heat of a square lattice (q = 4) according to (1) the Onsager solution, (2) the Bethe 
approximation, and (3) the Bragg-Williams approximation. 


by Onsager (1949), though he never published the details of his derivation. The first 
published derivation is due to Yang (1952), who showed that 


Lo(T) = T) = 

N/i 


| [l-{sinh(210r 4 ] 1/8 

to 


for 

for 


T <T C 
T > T c , 


(38a) 

(38b) 


where K, as usual, is J/kT. In the limit T 0, 

Zo (T) ~ 1 -2exp(-8 J/kT), (39) 

which implies a very slow variation with T. On the other hand, in the limit T -»• T c —, 

1/8 


~ l 2224 ( i ~y c 


Lo(T) 


1/8 


(40) 
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FIGURE 13.15 The spontaneous magnetization of a square lattice (q = 4) according to (1) the Onsager solution, 
(2) the Bethe approximation, and (3) the Bragg-Williams approximation. 


which indicates a very fast variation with T. The detailed dependence of L 0 on T is shown 
in Figure 13.15; for comparison, the results of the Bragg-Williams approximation and the 
Bethe approximation are also included. We infer that the critical exponent f) for this model 
is g, which is very different from the mean field value of 

Onsager also calculated the correlation length £ of the lattice, which showed that the 
critical exponent v = v' = 1 — in sharp contrast to the classical value of \. Finally, he set up 
calculations for the correlation function g(r) from which one could infer that the exponent 
q — again in disagreement with the classical value of zero. Precise asymptotic expres¬ 
sions for the correlation function in different regimes of temperature were derived by later 
authors (Fisher, 1959; Kadanoff, 1966a; Au-Yang and Perk, 1984): 


g(r) 


{4 (Kc-K)} 1 ' 4 _ r/e 

2 3 / 8 (7rr/f) 1 / 2 ’ 


£ = {4 (Kc-K)}- 1 


(41) 


for T > T c , 


g(r) 


WK-K C )}^ 2r/i 
2 21 / s n(r/%) 2 


£ = {4 (K-Kc)}- 1 


(42) 


for T < T c , and 


g(r) 


2 1 / 12 exp(3f'(—1)) 

ri A 


(43) 


at T — T c \ in the last expression, £'(jc) denotes the derivative of the Riemann zeta function. 
We note from expressions (41) and (42) that the correlation length §, while diverging at 
T — T c , is finite on both sides of the critical point. This feature is peculiar to the scalar 
model (n = 1) only, for in the case of vector models (n > 2), £ turns out to be infinite at 
all T < T c . 
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The zero-field susceptibility of this system has also been worked out (see Barouch et al., 
1973; Tracy and McCoy, 1973; Wu et al., 1976); asymptotically, one finds that 

_ Np 2 x f C+t~ 7/4 for t>0 (44a) 

kT c ic_|tr 7/4 for f<0, (44b) 

where t, as usual, is (T — T c )/T c while the constants C + and C_ are about 0.96258 and 
0.02554, respectively. We see that the critical exponent y = y' = as opposed to the mean 
field value of 1, and the ratio C+/C- ~ 37.69, as opposed to the mean field value of 2. 
Assembling all the exponents in one place, we have for the two-dimensional Ising model 

a = a' = 0(log), p =-, Y = Y r = ^> v = v’ = 1, )? = -. (45) 

Since this model has not yet been solved in the presence of a field, a direct evaluation 

of the exponent <5 has not been possible. Assuming the validity of the scaling relations, 
however, we can safely conclude that 5 = 15 — again very different from the classical value 
of 3. All in all, the results of this section tell us very clearly, and loudly, how inadequate the 
mean field theory can be. 

Before we close this section, a few remarks seem to be in order. First of all, it may be 
mentioned that for the model under consideration one can also calculate the interfacial 
tension s, which may be defined as the “free energy associated, per unit area, with the 
interfaces between the domains of up spins and those of down spins"; in our analogy with 
the gas-liquid systems, this corresponds to the conventional surface tension a. The corre¬ 
sponding exponent p , that determines the manner in which .s' —»■ 0 as T T c — , turns out to 
be 1 for this model; see Baxter (1982). This indeed obeys the scaling relation p = (d — l)v, 
as stated in Problem 12.26. Second, we would like to point out that, while the solution 
of the two-dimensional Ising model was the first exact treatment that exposed the inade¬ 
quacy of the mean field theory, it was also the first to disclose the underlying universality of 
the problem. As discovered by Onsager himself, if the spin-spin interactions were allowed 
to have different strengths, / and J\ in the horizontal and vertical directions of the lattice, 
the specific-heat divergence at T — T c continued to be logarithmic — independently of the 
ratio J’/J — even though the value of T c itself and of the various amplitudes appearing in 
the final expressions were modified. A similar result for the spontaneous magnetization 
was obtained by Chang (1952) who showed that, regardless of the value of /'//, the expo¬ 
nent continued to be g. Further corroborative evidence for universality came from the 
analysis of two-dimensional lattices other than the square one which, despite structural 
differences, led to the same critical exponents as the ones listed in equation (45). 

13.4.A The two-dimensional Ising model on a finite lattice 

Phase transitions, viewed as critical phenomena, cannot occur in a finite system since 
a statistical mechanical model with a finite number of degrees of freedom cannot have 
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a nonanalytic partition function or free energy. Criticality occurs only in the thermody¬ 
namic limit. Since real physical systems are of finite size, the manner in which finite- 
size effects manifest themselves as the correlation length § approaches the system size 
is of considerable importance in understanding how critical singularities get rounded 
off in real systems. In this regard, the two-dimensional nearest-neighbor Ising model 
on a square lattice in zero field can be solved on a finite square lattice with periodic 
boundary conditions (Kaufman, 1949), which allows for a detailed exploration of finite- 
size effects, especially near the bulk critical point; see Ferdinand and Fisher (1969). 
Kaufman’s solution is based on a determination of all the eigenvalues of the transfer 
matrix. 

Onsager (1944) only required the largest eigenvalue since his solution was based on 
a strip geometry with the length of one side taken to infinity. We here consider the Ising 
model on a lattice with n rows and in columns with periodic boundary conditions; see 
Figure 13.16. Each column of n spins has 2" possible configurations, so the transfer matrix 
P that couples nearest-neighbor columns is a 2" x 2” matrix of Boltzmann factors with 
eigenvalues k a , with a — 1,2,..., 2”. Just as in the case of the one-dimensional Ising model 
studied in Section 13.2, the partition function of a system with n rows and m columns can 
be written as the trace of a transfer matrix P: 

2 n 

Qn,m(K) = Trace (P" 1 ) = £*£*, (46) 

a—l 

where the eigenvalues of the transfer matrix fall into two classes: 

^ (2sinh(2K))" /2 £]exp (±yo ± Y2 ± ■■■ ± Y2n-i)^, 

(2sinh(2K))" /2 £]exp Q (±yi ± Y3 ± ■■■ ± Y2n-i)^j- 



FIGURE 13.16 A finite square lattice with n = 4 rows and m = 6 columns. In view of the periodic boundary 
conditions, sites on the leftmost column interact with sites on the rightmost column and the bottom row interacts 
with the top row. 
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The quantity y q for 0 < q < 2n is the positive root of the equation 


cosh(yg) = 


cosh“(2^0 

sinh(2K) 


— cos 


(?)■ 


while the q = 0 case is given by 


(48) 


e ro — e 2K tanh(K). (49) 

Only terms with an even number of minus signs inside the exponentials appear in the 
sums in equation (47), so the partition function can be written as 

Qn,m(K) = ^(2smh(2K)) nm ( 2 (Y ] + Y 2 + Y 3 + F 4 ), (50) 

where 



n —1 

( m \ \ 

, 2 Y2q+1 J J > 


Y\ 

= ]~[ ( 2 cosh ( 

(51a) 


q= 0 




n— 1 

• m \\ 

,2 Y2q+l ))' 


y 2 - 

= ]~[ ("2sinh ( 

(51b) 


q=0 




72—1 

( m \\ 
ci Y2 v)' 


Y 3 -~ 

= ]~| ^2 cosh | 

(51c) 


q= 0 




72—1 

m \\ 

,2 ~ Y2q )) ’ 


t 4 = 

= ]~[ (2sinh ( 

(5 Id) 


q= 0 




see Kaufman (1949). This form of the partition function allows for an exact calculation 
of the free energy, internal energy, and specific heat on finite lattices; see Figure 13.17. 
The logarithmic singularity in the specific heat at the bulk critical point evolves from a 
specific heat peak that grows logarithmically with the system size, that is C nm (K c )/nmk ~ 
(8K 2 /jz) ln(«) ~ 0.4945ln(n); see Ferdinand and Fisher (1969). Note also that the coeffi¬ 
cient of In (n) here is the same as the coefficient of the ln(|l — T/T c \) term in the bulk 
specific heat, as given in equation (37). 

The low-temperature series expansion for the partition function can be written as 
Qn,m(K) = e 2nmK Qn,m(K), where 


nm 

Qn,m(K) = y ] gq x ^t (52) 

q =0 

x = e~ 2K is the Boltzmann factor for a single excitation, and the coefficients g q denote 
the number of configurations with energy AqJ above the ground state. The sum of the 
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FIGURE 13.17 Specific heat of the two-dimensional Ising model for finite 2 x 2, 4 x 4,.... 64 x 64 lattices. The 
specific heat is analytic for all finite lattices. The maximum value of the specific heat grows proportional to the 
logarithm of the linear dimension of the lattice and the location of the maximum approaches the bulk critical 
temperature (denoted by the vertical line) proportional to the inverse of the linear dimension of the lattice. From 
Ferdinand and Fisher (1969). Reprinted with permission; copyright © 1969, American Physical Society. 


coefficients counts all the microstates in the system; therefore 


nm 


lim Q 

0 


n,m 


W = X> = 2 nm . 

q=0 


The coefficients g q represent the number of microstates pertaining to energy (—2 nmj + 
4 qj), with the corresponding entropy being lc\ng q . 

The first term in the series is go = 2 since there are two degenerate ground states, 
namely all spins up or all spins down. It is straightforward to see that only even orders in 
x appear in this expansion. Examples of the low-order graphs that contribute to the series 
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(a) (b) (c) (d) 


FIGURE 13.18 The lowest few excited states of the lattice. The q = 2 states (a), have a single down spin in a sea of 
up spins or a single up spin in a sea of down spins; these states have energy 8 \J above the ground state and there 
are g 2 = 2nm configurations. The <7 = 3 states (b), have a pair of down spins in a sea of up spins, or vice versa; these 
states have energy 12/ above the ground state and g 3 = 4 nm configurations. The q = 4 states (c) and (d), can have a 
single grouping of opposite spins, or a pair of isolated flipped spins; these states have energy 16/ above the ground 
state and the total number of configurations g 4 = (nm) 2 + Snm. 


are shown in Figure 13.18. The first few terms in the series are 

Qn,m(K) = 2+ (2nm)x 4 + ( 4nm)x 6 + [(nm) 2 + 9 nm\ x 8 

+ (4(nm) 2 + 24nm\x 10 + _ (53) 

If both n and m are even, the model’s ferromagnetic/antiferromagnetic symmetry 
(/ -/ and s t -* -Si on one sublattice) gives: g q — g nm -q ■ Due to the self-duality of 

the two-dimensional square lattice, exactly the same coefficients gq also appear in the 
high-temperature series expansion where the expansion variable is tanhfV 

The probability P q of finding an equilibrium state with energy 4 qj above the ground 
state is given by 


p = 

Qn,m(K) 

and the internal energy and the heat capacity per spin are given by 


jj ^ 1 v 

W = {N=nm) ’ 

q =0 



(54) 


(55a) 


(55b) 


One can cast Kaufman’s solution, equation (50) and equation (51), in the form of a 
low-temperature expansion of the form shown in (52), thereby giving an exact determi¬ 
nation of the partition function and the equilibrium energy distribution; see Beale (1996). 
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The low-temperature series (52) can be written as 

nm 

Qn,m(K) = = (Z\ + Zi +Z 3 + Z 4 ), 

q=0 

where if n is even, then 

n/ 2-1 

Zl = 2 11 C 2q+1’ 
q= 0 

n/ 2-1 

z 2 = 2 n s 2 fl +i> 

q =0 

: n/ 2—1 

23=-C 0 C„ f] 4,- 
17=1 

j n/2—1 

z 4 = -s 0 s n n 4?; 

<7=1 

while if n is odd, then 

x (n—3)/2 

Zl = -C n ]~[ C^ +1 , 

q=0 

l (n- 3)/2 

z 2=n s ^+i- 

q=0 

l (n- 1)/2 

^3 = -C 0 n C l<7’ 

<7=1 

x (n-l)/2 

z 4=2 s ° n 4?- 

<7=1 


The factors in equations (57) and (58) are 


co = (1 — x) m + (x(l + x)) m , 
s 0 = (i-x) m -(xa+x)Y n , 
c n = (1 + x) m + (x(l - x)) m , 

Sn = a+X) m -(X(1-X)) m , 

^ (2j)l(m-2j)\ I 
J / 


c 2 = —— 
^ 2 m— 1 


(56) 


(57a) 

(57b) 

(57c) 

(57d) 


(58a) 

(58b) 

(58c) 

(58d) 


(59a) 

(59b) 

(59c) 

(59d) 

(59e) 
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= 2x(l — x 2 ), 



(59f) 


(59g) 


(l+x 2 ) 2 -/Scos(^) 


(59h) 


The function [z] denotes the largest integer less than or equal to z. The quantities c 2 and .s' 2 
were expanded using the binomial series in order to explicitly remove all square roots that 
would hide the polynomial nature of the final result. A symbolic programming language 
can be used to numerically expand the partition function as a polynomial in the variable 
x in the form (52). One must set the numerical precision in the calculation to somewhat 
more than nm In 2/In 10 decimal digits in order to determine the exact values of the integer 
coefficients {g q }. The numerical calculation can be checked against the low-order result 
(53) or with an exact enumeration of energies on small lattices. 

The low-temperature series for the Ising model on a 32 x 32 lattice is 


Q32,32(10 = 2 + 2048x 4 + 4096x 6 + 1057792X 8 + 4218880X 10 


+ 371621888X 12 + 2191790080X 14 
+ 100903637504X 16 + 768629792768X 18 
+ 22748079183872X 20 + ■ ■ • + 4096x 2042 
+ 2048x 2044 + 2X 2048 , 


(60) 


where the largest coefficient is 


g 5 l2 = 6,342,873,169,001,916,568,766,443,273,025,000,331,593,063, 
924,436,135,196,680,443,689,656,478,072,741,300,511,612, 
123,900,652,711,596,311,283,701,724,071,226,144,241,851, 

411,641,714,893,727,789,741,510,169,213,344,005,116,385, 

197,594,692,089,556,614,547,788,150,860,200,720,413,211, 

442,412,355,672,291,841,364,265,145,274,980,444,405,423, 

129,672,679,584,959,498,234,944,801,613,246,300,853,599, 
317,229,362,316, (61) 


that is, there are about 6.342 x 10 306 configurations with energy halfway between the 
ferromagnetic and antiferromagnetic ordered states. This single microstate comprises 
3.5 percent of the 2 1024 total configurations of the model. The exact results for the micro- 
canonical entropy and the energy distribution for the 128 x 128 lattice are shown in 
Figures 13.19 and 13.20. These results provide excellent tests of Monte Carlo simulation 
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methods, including broad histogram methods; see Beale (1996), Wang and Landau (2001), 
and Landau and Binder (2009). 2 



q/nm 


FIGURE 13.19 Microcanonical entropy per spin S/Nk = In (g q )/nm for the two dimensional Ising model on a 
128 x 128 lattice as calculated from equations (56), (57), and (59). The slope of the curve is proportional to the 
inverse temperature, so the state with q/nm = 1/2 represents the infinite temperature state with energy halfway 
between the ordered ferromagnetic and antiferromagnetic states; the largest coefficient gsi92 — 1.049 x to 4930 is 
the number of configurations with t/ = nm/2 = 8192. Likewise, the states at q = 0 and q = mn represent the 
ferromagnetic and antiferromagnetic ground states, so the slopes of the curve diverge logarithmically in the 
thermodynamic limit. 



FIGURE 13.20 The exact energy distribution P q for the two-dimensional Ising model on a 128 x 128 lattice for 
K = 0.4, K = K c ~ 0.4407, and K = 0.5. The variance of the energy distribution is proportional to the specific heat, so 
is largest near K = K C . Refer to Figure 13.17. 

2 Mathematica code for calculating the low-temperature series coefficients for a two-dimensional Ising model, as 
well as the microcanonical entropies, internal energies, and specific heats for several lattice sizes can be found at 
www.elsevierdirect.com. 
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13.5 The spherical model in arbitrary dimensions 

In the wake of Onsager’s solution to the two-dimensional Ising problem in zero field, sev¬ 
eral attempts were made to go beyond Onsager — by solving either the three-dimensional 
problem in zero field or the two-dimensional problem with field. None of these attempts 
succeeded; the best one could accomplish was to rederive the Onsager solution by newer 
means. This led to the suggestion that one may instead consider certain “adaptations” 
of the Ising model, which may turn out to be mathematically tractable in more than two 
dimensions and hopefully throw some light on the problem of phase transitions in more 
realistic situations (where d is usually 3). One such adaptation was devised by Kac who, in 
1947, considered a model in which the spin variable eq, instead of being restricted to the 
discrete choices —1 or +1, could vary continuously, from — oo to +oo, subject to a Gaussian 
probability distribution law, 


p(ot)doi - ( A/n) l/2 e Aa i do, (i = 1,... ,1V), (1) 

so that of, on an average, = 1/(2 A). Clearly, for conformity with the standard practice, 
namely of — 1, the constants here should be equal to we may, however, leave it arbi¬ 
trary for the time being. The resulting model is generally referred to as the Gaussian model, 
and its partition function in the presence of the field is given by the multiple integral 


+oo +oo 


Qn = 


-/-/(#) 


A \ N /2 ~ A Y. a f+ K Y. a i a j+ h Y.°i 


|"~[ da I (K = PJ,h = PfiB). 


( 2 ) 


The exponent in the integrand is a symmetric, quadratic function in the eq; using standard 
techniques, it can be diagonalized. Integrations over the (transformed) oj can then be car¬ 
ried out straightforwardly — and in any number of dimensions; for details, see Berlin and 
Kac (1952) or Baker (1990). 

One finds that for d > 2 the Gaussian model undergoes a phase transition at a finite 
temperature T c which, for a simple hypercubic lattice, is determined by the condition 
K c =A/d; note that, with A = \, this result is precisely the one predicted by the mean field 
theory (with q — 2d). There are differences, though. First of all, the present model does not 
exhibit a phase transition at a finite temperature for d <2. Secondly, the critical exponents 
for 2 < d < 4 are nonclassical, in the sense that some of them are d-dependent, though 
for d > 4 they do become classical. More importantly, at temperatures below T c , where K 
exceeds A/d, the integral in (2) diverges and the model breaks down! This led Kac to aban¬ 
don this model and invent a new one in which the spins were again continuous variables 
but subject to an overall constraint, 


J2of=N, 


(3) 
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rather than to individual constraints, of = 1 for each i, or to an arbitrary probability 
distribution law. Constraint (3) allows individual spins to vary over a rather wide range, 
—IV 1 / 2 to +/V 1 / 2 , but restricts the super spin vector { 07 } to the “surface of anlV-dimensional 
hypersphere of radius IV 1 / 2 ”; in the Ising model, the same vector is restricted to the “cor¬ 
ners of a hypercube inscribed within the above hypersphere.” The resulting model is 
generally referred to as the spherical model. 

Constraint (3) can be taken care of by inserting an appropriate delta function in the 
integrand of the partition function. Using the representation 


"-£”?)-sj 


x+zoo 

/ 


e > dz, 


(4) 


the partition function of the spherical model is given by 


X+lOO 


+OO +OO 


Qn = 

2tti 


/ dze “f”J 


-zY,crf+K £ cricrj+hJ^Vi _ 

e <■ nn - '■ n^- 


(5) 


For a fixed z, the integral over the 07 can be carried out in the same manner as in the Gaus¬ 
sian model; see equation (2). Let the result of that calculation be denoted by the symbol 
Zn(K, h;z). The partition function Q,y of the spherical model is then given by the complex 
integral 


Qn = 

2jt 


hi 


dze zN Zn(KJv,z), 


( 6 ) 


which can be evaluated by the saddle-point method — also known as the method of steep¬ 
est descent, see Section 3.2. One finds that the saddle point of the integrand in ( 6 ) lies at the 
point z — Xq, where x 0 is determined by the condition 


— {zN + InZjv {K, h,z)} 
dz 


z=x 0 


= 0 , 


with the result that, asymptotically, 


(7) 


^ In Q n ^x 0 + ^-j\nZ N (K,h;x 0 ). (8) 

The thermodynamic properties of the system can then be worked out in detail. 

It turned out that many physicists felt uncomfortable at the necessity of using the 
method of steepest descent, so a search for an alternative approach seemed desirable. In 
this connection Lewis and Wannier (1952) pointed out that while the ensemble underlying 
the model of Berlin and Kac was canonical in the variable E it was microcanonical in the 
variable They proposed that one consider instead an ensemble that is canonical in 
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both E and the method of steepest descent could then be avoided. All one requires 

now is that the constraint (3) be obeyed only in the sense of an ensemble average, 


fa)-* 

(9) 

rather than in the original sense that was comparatively more rigid. The resulting model is 
referred to as the mean spherical model. 

Constraint (9) can easily be taken care of by modifying the Hamiltonian of the system 
by including a term proportional to , that is, by writing 

H = -/yX aid] - nB^di + X^ a i> 

n.n. i i 

(10) 

where X is the so-called spherical field, and requiring that 


(( dH/dX))=N . 

( 11 ) 

The partition function of the revised model is thus given by 


+00 +00 o 

r r -AI£ c r i oj+/ 2 y> i 

Qn= j - J e t <■ Y]doi 

—oo —oo ^ 

( 12 a) 

= Zn(K, h ; px), 

( 12 b) 


with the constraint 


1 3lngy(^MM 

p ax 

Comparing (13) with (7), we readily see that the parameter xq of the spherical model is 
precisely equal to the parameter px of the mean spherical model. The free energy result¬ 
ing from ( 12 ), however, differs a little from the one given by equation ( 8 ), which is not 
surprising because the transition from a model that was microcanonical in the variable 
SC 2 (= YLi °f) 1 ° one that is canonical modifies the nature of the free energy — it goes from 
“being at constant S*” to “being at constant X.” The two free energies are connected by the 
Legendre transformation 


A x = A s > + X(S" 2 ) =A_y + XN t 


(14) 


so that 


—A y = —Ax — X. 
N ,y N 


(15) 


This is precisely the difference that arises from the use of expression ( 8 ) or expression (12). 
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We now proceed to examine the thermodynamic properties of the mean spherical 
model, especially the nature of its critical behavior in arbitrary dimensions. The impor¬ 
tance of these results will be discussed toward the end of this section. 


The thermodynamic functions 

We consider a simple hypercubic lattice, of dimensions L\ x • • • x L^, subject to periodic 
boundary conditions. The partition function of the system, as given by equation (12a), 
then turns out to be (see Joyce, 1972; Barber and Fisher, 1973) 


Zn(K, h ; f)X) 


n 

k 


n 1/2 


_P(X-fi k )_ 


e Ar/! 2 /4/J(A.-Mo) ) 


(16) 


where /i/ c are the eigenvalues of the problem, 

2:m; 

/xfc=/Vcos(fc/a), kj — —d {fij = 0,1,..., (Nj — 1)}, (17a) 

1=1 j 

Nj = Lj/a, N=Y\Nj, (17b) 


i 

and a the lattice constant of the system. The free energy is then given by 

Ax = J-y'ln fi(X ~^ k) - N ^ B2 , 

2/?^ n 4a-/r 0 ) 

while the parameter X is determined by the constraint equation, see equation (13), 


(18) 


1 ^ 1 Nfi 2 B 2 

— > -+- - - T =1V. (19) 

fc ~ _ ^o ) 2 

The magnetization M and the low-field susceptibility /o follow readily from equa¬ 
tion (18): 3 


M=^E, 

2a — mo) 


N p 2 

2 a — mo) 


(20a, b) 


Introducing the variable m(= M/Nu), the constraint equation (19) may be written in the 
form 


E 


i 


(7 - Mfc) 


= 2Npa-m z ). 


( 21 ) 


3 Note, however, that to calculate the field-dependent susceptibility, (SM/3B)j- subject to the spherical constraint 
(19), one must keep in mind the field dependence of X while differentiating (20a). 
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Next, the entropy of the system in zero field is given by 4 

S 0 = -0A/97V fcl B=o = ^fcB^[l-ln{ j 6(A- M )}] (22) 

k 


and the corresponding specific heat by 


C 0 = T 


3So\ 

Iff ) 




l T(dX/dT)ol 

= N 

r 1 *. 

(8X\ I 

(X - Mfc) . 

2 

V3?7oJ 


(23) 


here, use has been made of equation (19), with 5 = 0. 

To make further progress we need to determine X, as a function of B and T, from the 
constraint equation (19). But first note, from equation (18), that for the free energy of the 
system to be well-behaved, X must be larger than the largest eigenvalue p 0 — which, by 
(17a), is equal to Jd. At the same time, equation (20b) tells us that the singularity of the 
problem presumably lies at X = po- We may thus infer that, as T decreases from higher 
values downward, X also decreases and eventually reaches its lowest possible value, po, 
at some critical temperature, T c , where the system undergoes a phase transition. The 
condition for criticality, therefore, is 


Xc — po — Jd, (24) 

which suggests that we may introduce a “reduced held,” </>, by the definition 


4> = (X-Xc)/J=(X/J)-d; 


(25) 


the condition for criticality then becomes 


<Pc = o. (26) 

It follows that, as we approach the critical point from above, the parameter (p becomes 
much smaller than unity; ultimately, it becomes zero as T c is reached and stays so for all 
T < T c . 

Now, substituting for the eigenvalues p^ into the sum appearing in equations (19) and 
(21), and making use of the representation 

OO 

- = J e~ zx dx, (27) 

o 


l We denote Boltzmann’s constant by the symbol kg here so as to avoid confusion with the wavenumber k. 
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we have 



For Nj 1, the summation over rij may be replaced by integration; writing Oj = 2nrij/Nj, 
one gets 


2n 


£exp{---}» f 

n ■ J 


e ~x+xcos0j * L de . = Nj e~ x Io (x), 


where Iq(x ) is a modified Bessel function. Multiplying over j, one finally gets 

^ 1 N 

Ei—= T w dm, 

k x ~ llk 1 

where 1V^(0) is the so-called Watson function, defined by 5 

OO 

W d (<$>) = J e-^ie^IoW^dx. 

0 

Equations (19) and (21) now take the form 

W d (<j>) = 2K- ( P' lB) 
d 2K<t > 2 

= 2K(1 — m 2 ). 


(29) 


(30) 


(31) 


(32a) 

(32b) 


The asymptotic behavior of the functions W d ((p), for (/; <<c 1, is examined in Appendix G. 


The critical behavior 

We now analyze the various physical properties of the mean spherical model in different 
regimes of d. 

(a) d < 2. For this regime we take expression (7 a) of Appendix G and substitute it into 
equation (32a), with B — 0. We obtain 


T[(2-d)/2}l 2/(2 ~ d) rk B T\ 2/i2 ~ d) 
_ 2(2n) d / 2 K \ ~ V~/ 


(33) 


5 Note that our definition of the function W d (<t>) differs slightly from the one adopted by Barber and Fisher (1973); this 
difference arises from the fact that our J is twice, and our 0 is one-half, of theirs. 
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We see that </> in this case goes to zero only as T -*■ 0. The phase transition, therefore, 
takes place at T c — 0. Equations (20b), (23), (24), and (25) then give the following for 
the low-temperature susceptibility 


Xo = 


Nfi 2 


Nfi 2 / k B Ty 2l(2 ~ d) 


and for the low-temperature specific heat 


Co - -Nk B = -NJ 



—Nk B 


k B T 

I 


d/(2—d) 


(34) 


(35) 


(b) d — 2. We now use expression (7b) of Appendix G and obtain 


0Ib=o ~ exp(-4nJ/k B T), (36) 

so that once again T c = 0 but now, at low temperatures, 

Xo ~ (Nil 2 /J)exp(4nJ/k B T) (37) 


and 


C 0 - ^Nk B - Nk B (J!k B T) 2 exp(—4 jt// k B T). 


(38) 


(C) 2 < d < 4. We now substitute expression (7c) of Appendix G into equation (32a), with 
the result 


W d ( 0 ) 


\r{(2-d)/2}\ (d _ 2)/2 (PiiB) 2 

(2n) d / 2 9 ^ 2Kct> 2 ' 


(39) 


The critical point is now determined by setting B — 0 and letting </>-»• 0; the condition 
for criticality then turns out to be 


K c =-W d {0). 


(40) 


The variation of <j> with T as one approaches the critical point is given by 


0Ib=o : 


2(2n) d ' 2 (K c -K) 


2/(d—2) 


(K<K C ). 


(41a) 


\T((2-d)/2}\ 

We also note that once <p becomes zero it stays so for all temperatures below, that is, 

0Ib=o = 0 (K > K c ). (41b) 


It then follows that 


~ (Kc - Kr 2/(d ~ 2) ~(T- T c r 2Kd ~ 2) (T>Tc) 


Xo 


(42) 
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and is infinite for T < T c . At the same time 

C 0 - l -Nk H ~(T-T c ) (4 - d ^ <d - 2) (T>T C ) (43) 

and it vanishes for T < T c . The spontaneous magnetization is determined by 
equations (32b), (40), and (41b); we obtain a remarkably simple result 

m 0 = a-K c /K) 112 = (1 - T/T c ) 1/2 (T < T c ). (44) 

Finally, if in equation (39) we retain B but set T — T c , we get 

4>c ~ g 4 /(d+ 2 ) (T=T c y, (45) 


equation (20a) then gives 


m c = 


fiB 
2 Wc 


B 


id-2)/(d+2) 


(T — T c ). 


(46) 


The foregoing results give, for the critical exponents of the system, 


d-4 o _l _ 2 

d-2’ l '~2’ Y ~ d-2’ 


d + 2 

S=~— (2 < d < 4). 

d — 2 


(47) 


(d) d > 4. In this regime we employ expression (8) of Appendix G. The condition for 
criticality remains the same as in (40); the variation of </> with T as we approach the 
critical point is, however, different. We now have, for all d > 4, 

4>\b=o ~ (K c — K) 1 (K<K C ). (48) 


The subsequent results are modified accordingly: 

X0 ~CT—r c )-\ Co-^Nk B ~(T-T c )° ( T>T C ) (49) 

4>c~B 2 ' 3 , nic-B 1 ' 3 ( T=T C ). (50) 


Equations (41b) and (44) continue to apply as such. We, therefore, conclude that 

Qf = 0, P = \’ r = 1 ’ s = 3 (d>4). (51) 

(e) d — 4. For this borderline case, we use expression (12) of Appendix G. Once again, the 
condition for criticality remains the same; however, the variation of <p with T as one 
approaches the critical point is now determined by the implicit relation 


0ln(l/</>) « 8tt 2 (Kc — K) (K<K C ). 


(52) 
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Introducing the conventional parameter 


t=(T-T c )/T c = (K c -K)/K, 

we get, to leading order in t, 

<P\b=o ~ f/ln(l/f) (0<t«;l). 

It follows that 

Xo~r 1 ln(l/0, Co ~ 1/lnU/f) (0 < f« 1) 

4> c ~ B 2/3 /{ln(l/B)} 1/3 , m c ~ |_Bln(l/B)}h 3 (f = 0). 


(53) 


(54) 


(55) 

(56) 


Spin-spin correlations 

Following the procedure that led to equations (16) through (19), we obtain in the absence 
of the field 


G(r,r') =a(r)a(r') = 


1 ^ exp {i(k-R)} 

2 Nf) “ X - Hk 


(R = r-r); 


(57) 


compare to equation (19), with B = 0. The summation over k in (57) can be handled in the 
same manner as was done in (28); however, the resulting summation over nj now turns out 
to be 


2n 

/ AT. 

exp{ iRjdj/ a}e~ x+xcosB l -Tddj 

"i o 

= Nje~ x I Rj /a(xy, (58) 

compare to (29). This leads to the result 

OO 

C( R ) = 2^ / e“ 0A ' n [e- x I Rj/a W]dx- (59) 

0 i 

compare to equation (32a), with B— 0. For the functions l n (x) we may use the asymptotic 
expression (see Singh and Pathria, 1985a) 


gX— n 2 /2x 

hi(X) = 


V (2 nx) 


(x»l) 


(60) 
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so that, for<£ <*c 1, 


G(R) «- 1 —— [ e -^ x - R2/(2a2x) x- d ^dx 

2{2n ) d / 2 KJ 
0 

1 (a 2 \ (d ~ 2)/2 (R\ 

= j2ny^K\iR) (61) 

where K tl (x) is the other modified Bessel function while 

f = «/(20) 1/2 . (62) 

For R»(, we may use the asymptotic result (x) ~ {it/2x) l ^ 2 e~ x ‘, equation (61) then 

becomes 


G(fl) 


„d-2 


(d-3)/2 (2 ttR) ( d ~ 1 V 2 


-«/{ 


which identifies £ as the correlation length of the system. 

Now, comparing equation (62) with equation (20b), we find that 


(63) 


Xo = 


Nfi 2 

W 



(64) 


In view of the fact that £ ~ y ( J /2 , we infer that, in all regimes of d, the exponent v = \y ar| d 
hence, by relations (12.12.10) and (12.12.11), the exponent r] = 0. To obtain this last result 
directly from equation (61), we observe that, as T -»• T c from above, the parameter <j>-*■ 0 
and hence £ oo. We may then use the approximation f?/§ 1 and employ the formula 


^ (m > 0 ), 


(65) 


to obtain 


G(R)\t=t c 


T{{d — 2)/2) a d ~ 2 
4n d i 2 K c R d ~ 2 


( 66 ) 


The power of R appearing here clearly shows that r] — 0. Finally, substituting equation (41a) 
into equation (62), we get 




1 

2 a 


\r{(2-d)/2}\ 

4n d / 2 (K c -K)_ 


l/(d—2) 


(K<Kc), 


(67) 


which shows that for 2 < d < 4 the critical exponent v = l/(d — 2). 

For T <T C we expect the function G(i?) to affirm the presence of long-range order in 
the system, that is, in the limit R-* oo, it should tend to a limit, a 2 , that is nonzero. To 











518 Chapter 13 • Phase Transitions: Exact (or Almost Exact) Results for Various Models 


demonstrate this property of G(R), we need to take a closer look at the derivations of this 
subsection that were carried out with the express purpose of obtaining results valid in the 
thermodynamic limit (N -»• oo). This resulted in “errors” that were negligible in the region 
T>T C but are not so when T < T c . The first such error crept in when we replaced the sum¬ 
mations over {tij} in equation (28) by integrations; that suppressed contribution from the 
term with n = 0. Equation (30), therefore, accounts for only the (1c ± 0)-terms of the orig¬ 
inal sum in (28), and the missing term, 1 /J<p, may be added to it ad hoc. 6 Equation (19), 
with B = 0, then becomes 


1 

2 ? 


IN 

— + Y W d( r t > ) 
J<p J 


= N. 


( 68 ) 


Now, when </> becomes very very small, Wa(<j>) may be approximated by W^( 0), which is 
precisely equal to 2 K c ; equation (68) then gives 

miK-Kc)]- 1 (K > K c ), (69) 


rather than zero! The correlation length then turns out to be 

^=a/(2(j» l l 2 ^a[N(K-K c )] l l 2 (K > K c ), (70) 


rather than infinite! Now, the same error was committed once again in going from 
equation (57) to (59); so, the primary result for G(R), as given in equation (61), may be 
similarly amended by adding the missing term \/(2Nflj<p) which, by (69), is exactly equal 
to (1 - Kc/K). Thus, for R <K §, we obtain, instead of (66), 


G(R) 


( K c \ r{(d-2)/2} a d ~ 2 
V Kj 4t r d / 2 K R d ~ 2 


(K > K c ). 


(71) 


Now if we let R—^oo, G(R) does approach a nonzero value a 2 , which is precisely the same 
as given by equation (44). It is remarkable, though, that in the present derivation the 
magnetic field B has not been introduced at any stage of the calculation, which under¬ 
scores the fundamental role played by correlations in bringing about long-range order in 
the system. 

In the preceding paragraph we outlined the essential argument that led to the desired 
expression, (71), for G(R). For a more rigorous analysis of this problem, see Singh and 
Pathria (1985b, 1987a). 


Physical significance of the spherical model 

With a constraint as relaxed as in equation (3), or even more so in (9), one wonders how 
meaningful the spherical model is from a physical point of view. Relief comes from the 
fact, first established by Stanley (1968,1969a,b), that the spherical model provides a correct 

6 This is reminiscent of a similar problem, and a similar ad hoc solution, encountered in the study of Bose-Einstein 
condensation in Section 7.1; see also Section 13.6. 
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representation of the (n -»• oo)-limit of an n-vector model with nearest-neighbor interac¬ 
tions; see also Kac and Thompson (1971). This connection arises from the very nature of 
the constraint imposed on the model, which introduces a super spin vector S* with N 
degrees of freedom; it is not surprising that, in the limit N -*■ oo, the model in some sense 
acquires the same sort of freedom that an n-vector model has in the limit n -*■ oo. 

In any case, this connection brings the spherical model in line with, and actually makes 
it a good guide for, all models with continuous symmetry, namely the ones with n > 2. And 
since it can be solved exactly in arbitrary dimensions, this model gives us some idea as to 
what to expect of models for which n is finite. For instance, we have seen that, for d > 4, 
the critical exponents of the spherical model are the same as the ones obtained from the 
mean field theory. Now, fluctuations are neglected in the mean field theory but, among the 
variety of models we are considering, fluctuations should be largest in the spherical model, 
for it has the largest number of degrees of freedom. If, for d > 4, fluctuations turn out to be 
negligible in the spherical model, they would be even more so in models with finite n. It 
thus follows that, regardless of the actual value of n, mean held theory should be valid for 
all these models when d > 4. See, in this connection, Section 14.4 as well. 

For d < 4, the final results depend significantly on n. The spherical model now provides 
a starting point from which one may carry out the so-called (1 /n)-expansions to determine 
how models with finite n might behave in this regime. Such an approach was initiated 
by Abe and collaborators (1972, 1973) and independently by Ma (1973); for a detailed 
account of this approach, along with the results following from it, see Ma (1976c). Finally, 
for a comprehensive discussion of the spherical model, including the one with long-range 
interactions, see the review article by Joyce (1972). 


13.6 The ideal Bose gas in arbitrary dimensions 

In this section we propose to examine the problem of Bose-Einstein condensation in an 
ideal Bose gas in arbitrary dimensions. As was first shown by Gunton and Buckingham 
(1968), the phenomenon of Bose-Einstein condensation falls in the same universality class 
as the phase transition in the spherical model; accordingly, the ideal Bose gas too corre¬ 
sponds to the (n -»■ oo)-limit of an n-vector model. It must, however, be borne in mind 
that liquid He 4 , whose transition from a normal to a superfluid state is often regarded as a 
manifestation of the “Bose-Einstein condensation in an interacting Bose liquid,” actually 
pertains to the case n = 2. Now, just as the spherical model turns out to be a good guide 
for all models with continuous symmetry including the XY model (for which n = 2), in the 
same way the ideal Bose gas has also been a good guide for liquid He 4 . 

We consider a Bose gas composed of N noninteracting particles confined to a box of 
volume V(= Li x ■ ■ • x La) at temperature T. Following the procedure of Section 7.1, we 
obtain for the pressure P of the gas 

knT \ -\ , _o knT 

P =-^E 111 * 1 - ^ > = -Tjf8(d+2,I2(Z), 

8 


( 1 ) 
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where X [— hj-J (2n rnksT)} is the mean thermal wavelength of the particles, z is the fugacity 
of the gas, which is related to the chemical potential p through the formula 

z = exp(Pp) <1 (P= l/fcflT), (2) 

while g v (z) are Bose-Einstein functions whose main properties are discussed in Appen¬ 
dix D. The quantity z is determined by the equation 

N=Y J (z~ le>iE -l) -1 = N 0 +N e , (3) 


where Nq is the mean number of particles in the ground state (e = 0), 

N 0 =z/(l-z), (4) 

while N e is the mean number of particles in the excited states (e > 0): 

Ne = ^gd/2(z)- (5) 

At high temperatures, where z is significantly below the limiting value 1, Mo is negligible 
in comparison with IV; the quantity z is then determined by the simplified equation 

N=^gd/ 2<X>, (6) 

and the pressure P in turn is given by the expression 

p = Nk B T gjd+ 2)/2<X> m 

v gd/2(Z) 

The internal energy of the gas may be obtained from the relationship 

U = ^d(PV); (8) 


see the corresponding derivation of equation (7.1.12) as well as of equation (6.4.4). 

Now, making use of the recurrence relation (D.10) and remembering that the mean 
thermal wavelength X cx T -1 / 2 , we get from equation (6) 


l/9z\ d gd/ 2 (z) f _ V\ 

z\dTj v ~ 2T g(d-2)/2(z) \ V ~NJ’ 


0 ) 


and from equation (1) 


1/9z\ _ d+2 g{d+ 2 )/ 2 (z) 

Z\dTj p ~ 2 T gd/2(Z) 
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It is now straightforward to show that the specific heats Cy and Cp of the gas are given by 
the formulae 


Cy _ d(d + 2) g (d+2) /2(z) d 2 g d/ 2(z) 
Nk B ~ 4 gd/2(z) 4 g(d-2)/2(Z) 


and 


Cp _ (d+2) 2 {g(d+2)/2(Z)} 2 g(d-2)/2(z) d(d + 2) g { d+2)/2(Z) 

Nk B 4 {gd/2(Z)} 3 4 gd/2(Z) 

respectively; it follows that the ratio 

Cp _ (d + 2) g(d+2)/2(Z)g(d-2)/2(Z) 

Cy ~ d {gd/2(z )} 2 

The isothermal compressibility up and the adiabatic compressibility turn out to be 


1 / dv\ _ 1 (dv/dz)r _ g(d.+2)l2(z)g(d-2)/2(z) 1 

KT V9fi/ T - ~V(dP/dz)T ~ {gd,2(Z )} 2 P 

and 


(14) 


= 1 ( dv/3T) z _ d 1 

KS v\dp) s v(dP/dT) z d + 2 P’ 

respectively; note that the ratio kt/ks is precisely equal to the ratio Cp/Cy, as is expected 
thermodynamically. 

As the temperature of the gas is reduced, keeping v constant, the fugacity z increases 
and ultimately approaches its limiting value 1 — marking the end of the regime where N 0 
was negligible in comparison with N. Whether this limit is reached at a finite T or at T = 0 
depends entirely on the value of d ; see equation (6), which tells us that if the function 
gd /2 (z) i as z 1—, is bounded then the limit in question will be reached at a finite T. On 
the other hand, if grf /2 (z) , as z -*■ 1—, is unbounded then the desired limit will be reached at 
T = 0 instead. To settle this question, we refer to equations (D.9) and (D.ll), which 
summarize the behavior of the function g v (z) as z -*■ 1- (or as a -* 0+, where a — — lnz); 
we thus have 


gd/2(e “) 


const. 


ln(l/a) + -a 


2-d 

2 


(d—2)/2 


f(2)-{ln(l/a) + l}G( 

diYd*- 2 ' 


for 

d <2 

(16a) 

for 

d = 2 

(16b) 

for 

2 < d < 4 

(16c) 

for 

d = 4 

(16d) 

for 

d > 4, 

(16e) 
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£(v) being the Riemann zeta function. Similarity of this system with the spherical model 
is quite transparent. 

We readily see from equation (6) that, for d > 2, a -*■ 0 at a finite temperature T c , 
given by 

) d c =vt;(d/2), (17) 


with the result that 


T r . — 


h 2 


1 


-, 2 /d 


2 nmkB \_vt;(d/2)_ 


(18) 


for d < 2, a -»• 0 only as X -> oo, so T c = 0. For brevity, we confine our further discussion 
only to d > 2. 


The critical behavior 

As T approaches T c from above, the manner in which « -> 0 is determined by substitut¬ 
ing the appropriate expression (16) into (6) and utilizing the criticality condition (17). For 
2 < d < 4, one gets asymptotically 


r 


2 -d 


a ( d - 2)/2 






(19) 


For T > T c , this gives 


«~f2/f rf -2) [f=(r-r r )/r c ,o<t«i]. 


( 20 ) 


As T -> T c , the specific heat Cp and the isothermal compressibility kj diverge because the 
function g { d-2)/2(z) appearing in equations (12) and (14), being~a -(4-d)/2 [see equation 
(D.9)], becomes divergent; for small t, 

c P ~K T ~ r l4_dV(d “ 2) . ( 2 i) 


The specific heat Cy, on the other hand, approaches a finite value, 

/ Cy \ _ d(d + 2) ^{(d + 2)/2j 

\Nks) t^t c + 4 Hd/2) 

with a derivative that, depending on the actual value of d, might diverge: 


J_/9Cv\ _1 d 2 (d + 2) g ( d+ 2)/2 (z) d 2 g d / 2 (z) 
Nk B \dT ) v ~ T 8 gd /2 (z) 4 g ( d— 2 y 2 (X> 

d 3 {gd/2(^)) 2 g(d-4)/2(^) 

8 lg(d-2)/2(2l) 3 

_ a -(d- 3 )_ t — 2 (d— 3 )/(d— 2 ) 


( 22 ) 


(23a) 


(3 < d < 4). 


(23b) 
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Equating the exponent appearing here with (1 +a), we conclude' that the critical expo¬ 
nent a for this system is (d — 4) / (d — 2); compare to equation (13.5.47). For a proper 
appreciation of the critical behavior of Cy, we must as well examine the region T < T c , 
along with the limit T -»• T c —. 

ForT < T c , the fugacity z is essentially equal to 1; equations (5) and (17) then give 


AT V (d 

N '=tAi 




(24) 


It follows that 


N 0 = N-N e =N 




Equation (4) then tells us that the precise value of z in this region is given by 


z = Nq/(Nq + 1) — 1 — 1/iVo, 


(25) 


(26) 


which gives 


o' = — lnz~ 1/ATo, 

rather than zero. Disregarding this subtlety, equation (1) gives 


k B T ( d 4- 2 




j'(d+ 2)/2 


(27) 


(28) 


Since P here is a function of T only, the quantities k j and Cp in this region are infinite; see, 
however, Problem 13.26. From equations (8) and (28), we get 


a- 1 - d d+2 ' 

2 X d ' 


/ d -(- 2 \ 
A 


(29) 


which gives 


C v _d(d + 2) v Jd + 2\ _ d(d + 2) f((d + 2)/2j / T\ d/2 
Nk B ~ 4 X d( { 2 )~ 4 ad/2) VrJ 

As T ->• T c -, we obtain precisely the same limit as in (22) — showing that Cy is continuous 
at the critical point. Its derivative, however, turns out to be different from the one given in 

7 We equate this exponent with (1 + a ) because if Cv ~ t~ a , then dCv/St ~ We hasten to add that the critical 

exponent a should not be confused with the physical quantity denoted by the same symbol, namely a(— —lnz), which 
was introduced just before equations (16) and has been used throughout this section. 
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(23), for now 


1 

Nks 



1 d 2 (d + 2) f ((d + 2)/2) (d—2)/2 

T c 8 ?(d/2) 

1 d 2 (d + 2) f{(d + 2)/2) 

% 8 f(d/2) 



(31b) 


(31a) 


as T T c -. 


As for the condensate fraction, A/q/AT, equation (25) gives 


A/o 

At 



T\ rf/2 d 

T ) *-\t\ [f<0,|f|«l]. 


(32) 


Now, the order parameter in the present problem is a complex number, 'To, such that 
|'T 0 | 2 = Nq/V, the condensate particle density in the system; see Gunton and Buckingham 
(1968). We therefore, expect that, for \ t\ «; 1, N 0 would be ~ t 2 P; equation (32) then tells us 
that the critical exponent p in this case has the classical value 1 for all d> 2. To determine 
the exponents y and <5, we must introduce a “complex Bose field, conjugate to the order 
parameter 4' 0 ” and examine quantities such as the “Bose susceptibility” / as well as the 
variation of 'To with the Bose field at T =T C . Proceeding that way, one obtains: y = 2/{d - 
2) and S — (d + 2)/{d — 2), just as for the spherical model. 

The pair correlations 

We now examine the pair correlation function of the ideal Bose gas 


k 


(33) 


ga+^E(fe) _ l ' 


As usual, we replace the summation over Ic by integration (mindful of the fact that this 
replacement suppresses the (k = 0)-term which may, therefore, be kept aside). Making use 
of equation (C.ll) in Appendix C, we get 




TD i _L V fl -A-^« 2 /A 2 t-d/2 
V + k d2^ e J 

t=i 



(34) 


compare to equations (3) through (5), which pertain to the case R — 0. For R> 0, one may 
extend the summation over j from j = 0 to j = oo, for the term so added is identically zero. 
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At the same time, the summation over j may be replaced by integration — committing 
errors 0(e ~ R / A ), which are negligible so long as R yx /,; for details, see Zasada and Pathria 
(1976). We thus obtain 


G(R) = + p / e-J a - nR2/ ^ 2 j- d/2 dj 

0 

n 0 . 2 m 

V + xH2nl;R)(‘l-W K(d - 2)l2 U/ 

where (x) is a modihed Bessel function while 


f = X/(2n 1/2 a 1/2 ). 


(36) 


For T > T c , we may use expression (20) for a; equation (36) then gives 

§ - xt~ 1/(d ~ 2) (0 < t «; 1), (37) 


which means that ^ y> X. Now, if R y> £, equation (35) reduces to 

G(B) ^ X 2 (2 n ^)(d--i)l2R(d-\)l 2e /? ’ (38) 

which identifies £ as the correlation length of the system. Equation (37) then tells us that 
for, 2 < d < 4, the critical exponent v of the ideal Bose gas is 1 /(d — 2). At T=T C ,H is 
infinite; equation (35) now gives 


G(R) 


r{(d- 2 )/ 2 ) 

n (d-2)/2 x 2 R d-2’ 


(39) 


which shows that the critical exponent rj = 0. For T < T c , f continues to be infinite but 
now the condensate fraction, which is a measure of the long-range order in the system, is 
nonzero. The correlation function then assumes the form 

G(R) = I'I'ol 2 + (4°) 


where 


A(T) = 


r[(d- 2 )/ 2 j 

jr (d-2)/2 X 2 


(41) 


compare this result to the corresponding equation (13.5.7) of the spherical model. 

In the paper quoted earlier, Gunton and Buckingham also generalized the study of 
Bose-Einstein condensation to the single-particle energy spectrum e ~ k a , where a is a 
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positive number not necessarily equal to 2. They found that the phase transition, at a 
finite temperature T c , now took place for all d > a, and the critical exponents in the regime 
a < d <2a turned out to be 


d — 2a 
d — a 



Y = 


a 

d— a' 


d+a 
d — a' 


1 

v=— -, ;; = 2 — a. 

d — a 


(42) 


While mathematically correct, these results left one with the awkward conclusion that the 
Bose gas in its extreme relativistic state (cr = 1) was in a different universality class than the 
one in the nonrelativistic state (a — 2). It was shown later by Singh and Pandita (1983) that, 
if one employs the appropriate energy spectrum e — c-J(m 2 f 2 + h 2 k 2 ) and, at the same 
time, allows for the possibility of particle-antiparticle pair production in the system, as 
had been suggested earlier by Haber and Weldon (1981, 1982), then the relativistic Bose 
gas falls in the same universality class as the nonrelativistic one; see Problem (13.27). 


13.7 Other models 

In Section 13.4 we saw that a two-dimensional lattice model characterized by a discrete 
order parameter (.n — 1) underwent a phase transition, accompanied by a spontaneous 
magnetization mo, at a finite temperature T c ; naturally, one would expect the same if d 
were greater than 2. On the other hand, the spherical model, which is characterized by 
a continuous order parameter (with n — oo), undergoes such a transition only if d > 2. 
The question then arises whether intermediate models, with n = 2,3,..., would undergo 
a phase transition at a finite T c if d were equal to 2. The answer to this important question 
was provided by Mermin and Wagner (1966) who, making use of a well-known inequality 
due to Bogoliubov (1962), established the following theorem: 8 

Systems composed of spins with continuous symmetry (n > 2) and short-range inter¬ 
actions do not acquire spontaneous magnetization at any finite temperature T if the 
space dimensionality d <2. 

In this sense, systems with n > 2 behave in a manner similar to the spherical model — and 
quite unlike the Ising model! 

The marginal case (n — 2,d = 2), however, deserves a special mention. Clearly, this 
refers to an XY model in two dimensions, which has a direct relevance to superfluid He 4 
adsorbed on a substrate. As shown by Kosterlitz and Thouless (1972, 1973), this model 
exhibits a curious phase transition in that, while no long-range order develops at any 
finite temperature T, various physical quantities such as the susceptibility, the correla¬ 
tion length, the specific heat, and the superfluid density do become singular at a finite 
temperature T c , whose precise value is determined by point defects, such as vortices or 


For a review of this theorem and other allied questions, see (Griffiths, 1972, pp. 84-89). 
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dislocations, in the system. The correlation length §, as T -»■ T c +, displays an essential 
singularity, 

^^/CT-Tc)^ ) (1) 

where U is a nonuniversal constant; of course, the critical temperature itself is nonuni- 
versal and, for a square lattice, is given by 


K c =J/kT c ~ 1.12; (2) 

see Hasenbusch and Pinn (1997) and Dukovski, Machta, and Chayes (2002). The singular 
part of the specific heat shows a somewhat similar behavior, namely 

c (s) - r 2 o) 


which, measured against a regular background, is rather indetectable since every deriva¬ 
tive of the specific heat is finite at the critical point, yet the function is nonanalytic. The 
superfluid density behaves rather strangely; it approaches a finite value, as T -* T c — , pre¬ 
ceded by a square-root cusp. The correlation function is no different; at T — T c , Kosterlitz 
(1974) found a logarithmic factor along with a power law, namely 


[ln(r/a)] 1/8 
S(r) -^-, 


(4) 


while for T < T c we encounter a temperature-dependent exponent /; such that 




(5) 


where r/(T) kT /2nJ, for kT <^J, as shown by Berezinskii (1970) and ij(T c ) = 1/4, as shown 
by Kosterlitz (1974); see Berche, Sanchez, and Paredes (2002) for a numerical determi¬ 
nation of r)(T). This phase, with a power-law decay of correlations, is said to display 
quasi-long-range order. For further details of this transition, see Kosterlitz and Thouless 
(1978) and Nelson (1983); for a pedagogical account, see Plischke and Bergersen (1989), 
Section 5.E. 

For other exactly soluble models in two dimensions, see Baxter (1982), Wu (1982), 
Nienhuis (1987), and Cardy (1987), where references to other relevant literature on the 
subject can also be found. 

We now proceed to consider the situation in three dimensions. Here, no exact solutions 
exist except for the extreme case n=oo, which has been discussed in Section 13.5. However, 
an enormous amount of effort has been spent in obtaining approximate solutions which, 
over the years, have become accurate enough to be regarded as “almost exact.” Irrespective 
of the model under study, the problem has generally been attacked along three different 
lines which, after some refinements, have led to almost identical results. In summary, these 
approaches may be described as follows. 
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The method of series expansions 

In this approach, the partition function or other relevant properties of the system 
are expanded as a high-temperature series such as (13.4.5), with expansion parameter 
v— tanh(/!/), or as a low-temperature series such as (13.4.9), with expansion parameter 
w — exp(—2/3/); in the presence of an external field, one would have series with two expan¬ 
sion parameters instead of one. In either case, the first major task involves the numerical 
computation of coefficients, such as n(r) and m(r), on the basis of graph theory and other 
allied techniques, while the second major task involves analysis of these coefficients with 
a view to determining the location and the nature of the singularity governing the prop¬ 
erty in question. The latter task is generally accomplished by employing the ratio method, 
which examines the trend of the ratio of two consecutive coefficients, such as n(r) and 
n(r — 1), as r -»• oo, or by constructing Pade approximants which, in a sense, provide a con¬ 
tinuation of the known (finite) series beyond its normal range of validity up to its radius of 
convergence — thus locating and examining the nature of the relevant singularity. Since 
their inception (in the mid-1950s for the ratio method and the early 1960s for the Pade 
approximants), these techniques have been expanded, refined, and enriched in so many 
ways that it would be hopeless to attempt a proper review of them here. Suffice it to say that 
the reader may refer to Volume 3 of the Domb-Green series, which is devoted solely to the 
method of series expansions — in particular, to the articles by Gaunt and Guttmann (1974) 
on the asymptotic analysis of the various coefficients, by Domb (1974) on the Ising model, 
by Rushbrooke, Baker, and Wood (1974) on the Heisenberg model, by Stanley (1974) on 
the n-vector models, and by Betts (1974) on the XY model. For more recent reviews, see 
Guttmann (1989) and Baker (1990), where references to other relevant work on the subject 
are also available. 

The renormalization group method 

This method is based on the crucial observation (Wilson, 1971) that, as the critical point 
is approached, the correlation length of the system becomes exceedingly large — with the 
result that the sensitivity of the system to a length transformation (or renormalization) 
gets exceedingly diminished. At the critical point itself, the correlation length becomes 
infinite and, with it, the system becomes totally insensitive to such a transformation! The 
fixed point of the transformation is then identified with the critical point of the system, 
and the behavior of the system parameters such as K and h, see equation (13.5.2), in the 
neighborhood of the fixed point determines the critical exponents, and so on. Since very 
few systems could be solved exactly, approximation procedures had to be developed to 
handle most of the cases under study. 

One such procedure starts with known results for the upper critical dimension d = 
4 and carries out expansions in terms of the (small) parameter e — 4 — d, while the 
other starts with known results for the spherical model (n — oo) and carries out expan¬ 
sions in terms of the (small) parameter 1 /n\ in the former case, the coefficients of the 
expansion would be n-dependent, while in the latter case they would be d-dependent. 
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Highly sophisticated manipulations enable one to obtain reliable results for e that are 
as large as 1 (which corresponds to the most practical dimension d = 3) and for n as 
small as 1 (which corresponds to a large variety of systems that can be described through 
an order parameter that is scalar). We propose to discuss this method at length in 
Chapter 14; for the present, suffice it to say that the reader interested in further details 
may refer to Volume 6 of the Domb-Green series, which is devoted entirely to this topic. 

Monte Carlo methods 

As the name implies, these methods employ pseudorandom numbers to simulate statis¬ 
tical fluctuations for carrying out numerical integrations and computer simulations of 
systems with large numbers of degrees of freedom. Such methods have been in vogue for 
quite some time and, fortunately, have kept pace with developments along other lines of 
approach — so much so that they have adapted themselves to the ideas of the renormaliza¬ 
tion group as well (see Ma, 1976b). The interested reader may refer to Binder (1986, 1987, 
1992), Frenkel and Smit (2002), Binder and Heermann (2002), and Landau and Binder 
(2009). We propose to discuss computer simulation methods further in Chapter 16. 

As mentioned earlier, the aforementioned methods lead to results that are mutually 
compatible and, within the stated margins of error, essentially identical. Table 13.1 lists 
the generally accepted values of the critical exponents of a three-dimensional system with 
n — 0,1,2,3, and oo. It includes all the major exponents except a, which can be deter¬ 
mined by using the scaling relation a + 2fi + y = 2 (or the hyperscaling relation dv — 2 —a); 
we thus obtain, for n = 0,1,2,3, and oo, a — 0.235, 0.111, —0.008, —0.114, and —1, respec¬ 
tively — of course, with appropriate margins of error. The theoretical results assem¬ 
bled here may be compared with the corresponding experimental ones listed earlier in 
Table 12.1, remembering that, while all other entries there are Ising-like (n — 1), the case 
of superfluid He 4 pertains to n — 2. 

Table 13.1 includes exponents for the case n — 0 as well. This relates to the fact that if 
the spin dimensionality n is treated as a continuously varying parameter then the limit 
n-> 0 corresponds to the statistical behavior of self-avoiding random walks and hence of 
polymers (de Gennes, 1972; des Cloizeaux, 1974). The role of t in that case is played by the 
parameter 1 /N, where N is the number of steps constituting the walk or the number of 


Table 13.1 Theoretical Values of the Critical Exponents in Three Dimensions 



n = 0 

11= 1 

n = 2 

n = 3 

n = oo 

p 

0.302 ±0.004 

0.324 ±0.006 

0.346 ±0.009 

0.362 ±0.012 

0.5 

y 

1.161 ±0.003 

1.241 ±0.004 

1.316±0.009 

1.39 ±0.01 

2.0 

s 

4.85 ±0.08 

4.82 ±0.06 

4.81 ±0.08 

4.82 ±0.12 

5.0 

V 

0.588 ±0.001 

0.630 ±0.002 

0.669 ±0.003 

0.705 ±0.005 

1.0 

V 

0.026 ±0.014 

0.031 ±0.011 

0.032 ±0.015 

0.031 ±0.022 

0.0 


Source: After Baker (1990). 
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monomers constituting the polymer chain; thus, the limit N -*■ oo corresponds Lo t — 0, 
whereby one approaches the critical point of the system. Concepts such as the correlation 
function, the correlation length, the susceptibility, the free energy, and so on, can all be 
introduced systematically into the problem and one obtains a well-defined model that 
fits neatly with the rest of the family. For details, see de Gennes (1979) and des Cloizeaux 
(1982). 9 

Finally, we look at the situation with d > 4. As mentioned earlier, especially toward the 
end of Section 13.5, critical exponents for d > 4 are independent of n and have values as 
predicted by the mean field theory. To recapitulate, they are: 

a — 0, = —, y = 1, ^ = 3, v = —, i) = 0. (6) 

At the borderline dimensionality d — 4, two nonclassical features appear. First, the nature 
of the singularity is such that it cannot be represented by a power law alone; logarithmic 
factors are also present. Second, the dependence on n shows up in a striking fashion. In 
this context, we simply quote the results; for details, see Brezin et al. (1976): 


c®' 


(h = 0,t> 0) 

(7) 

m 0 - 

- |f| 1/2 |ln|f|| 3/(n + 8) 

(h = 0,t < 0) 

(8) 

X ' 

- r 1 |lnf| (n+2)/( "+ 8) 

s 

II 

p 

IV 

P 

(9) 

h- 

- m 3 |lnm| _1 

(t=0,h> 0), 

GO) 


along with the fact that rj — 0 and hence £ ~ / 1 ^ 2 . In the limit n -> oo, these results go over 
to the ones pertaining to the spherical model; see Section 13.5. 


Problems 

13 . 1 . Making use of expressions (12.3.17) through (12.3.19), (13.2.12), and (13.2.13), show that the 

expectation values of the numbers N + ,N-,N ++ , N _, andA/+_ in the case of an Ising chain are 

_ N P(P,B)±sinh(PfiB) 

± 2 P(P,B) 

N++ = 2D{ N B) e^ B [P(p,B) + smh(^B)}, 

= 2D(JS B ) ~ sinh (finB)] 

9 Values of n other than the ones appearing in Table 13.1 are sometimes encountered. For instance, certain antiferro¬ 
magnetic order-disorder transitions require for their description an order parameter with n = 4,6, 8, or 12; see Mukamel 
(1975), and Bak, Krinsky, and Mukamel (1976). Another example of this is provided by the superfluidity of liquid He 3 , 
which seems to require an order parameter with n = 18; see, for instance, Anderson (1984), and Vollhardt and Wolfle 
(1990). Even negative values of n have been investigated; see Balian and Toulouse (1973) and Fisher (1973). 
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and 


where 


Nj — = ■ 


N 




-m 


P(P,B) = {e' 4W + sinh 2 {ppB)} 1/2 


and 


D(fi,B) = P(p,B)[P(f},B) + cosh(/htB)]. 


Check that (i) the preceding expressions satisfy the requirement that N ++ + N + N + - = N and 

(ii) they agree with the quasichemical approximation (12.6.22), regardless of the value of B. 

13.2. (a) Show that the partition function of an Ising lattice can be written as 

Qn(B, T) = m gN(N+ , N + -) exp{ —PHn(N + , iV+_)), 

N+,N+- 

where 

H n (N+,N+.) = -jQqN-2N + ^-pB(2N+-N), 

while other symbols have their usual meanings; compare these results to equations (12.3.19) 
and (12.3.20). 

(b) Next, determine the combinatorial factor gw(N + ,N + -) for an Ising chain (q = 2) and show 
that, asymptotically, 

In g N (N +t N+J) « N+ In A/ 4 . + (N - N+) \n(N - N+) 



Now, assuming that In Qn ~ (the logarithm of the largest term in the sum J^'), evaluate the 
Helmholtz free energy A(B, T ) of the system and show that this leads to precisely the same 
results as the ones quoted in the preceding problem as well as the ones obtained in 
Section 13.2. 

13.3. Using the approximate expression, see Fowler and Guggenheim (1940), 


g N m,N 12 ) 


1 

{ l 2« N ) 

I! 

( N\\N 2 \\ 

N n \N 22 \[ 

'(> 12 )! 

r 

{ N\ ) 


for evaluating the partition function of an Ising lattice, show that one is led to the same results as 
the ones following from the Bethe approximation. 

[Note that, for q = 2, the quantity lng here is asymptotically exact; see Problem 13.2(b). No 
wonder that the Bethe approximation gives exact results in the case of an Ising chain.] 

13.4. Making use of relation (13.2.37), along with expressions (13.2.8) for the eigenvalues Ai and X 2 of 
the transfer matrix P, determine the correlation length $(K, T) of the Ising chain in the presence of 
a magnetic held. Evaluate the critical exponent v c , as defined in Problem 12.25, and check that 
v c = v/A, where v and A are standard exponents defined in Sections 12.10 and 12.12. 
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13.5. Consider a one-dimensional Ising system in a fluctuating magnetic field B, so that 

P2 N 


Qn(s,T )■ 


t BNB 2 N 1 

J dB^2 exp - ^ IPB-Bcri + PJoiOi+i] \, 

-00 i=1 ' 


with (Tjv+i = o"i. Note that when the system is very large (i.e., N » 1) the typical value of B is 
very small; nevertheless, the presence of this small fluctuating field leads to an order-disorder 
transition in this one-dimensional system! Determine the critical temperature of this transition. 

13.6. Solve exactly the problem of a field-free Ising chain with nearest-neighbor and 
next-nearest-neighbor interactions, so that 

H{cri} = -hJ2 <T ( or i+i —h'y'om+z, 

i i 

and examine the various properties of interest of this model. 

[Hint: Introduce a new variable t; = a>:cr,: + i = ±1, with the result that 

hflul = -Ji ^2ti-h^2 T ‘ T '+ 1 * 

i i 

which is formally similar to expression (13.2.1)] . 

13.7. Consider a double Ising chain such that the nearest-neighbor coupling constant along either 
chain is 7i while the one linking adjacent spins in the two chains is / 2 . Then, in the absence of the 
held, 

= -hJ2^m+l+crlal +1 ) -] 2 Yl, a i a i- 

i i 

Show that the partition function of this system is given by 

^lnQRa ^ln[2cosh.K2{cosh21Ci + •v/(l + sinh 2 2iCitanh 2 l<'2)}], 

where K\ = fh and K 2 = fj 2 . Examine the various thermodynamic properties of this system. 

[Hint. Express the Hamiltonian H in a symmetric form by writing the last term as 
— 1 Jailerja. +cr,: + 1 cP +1 ) and use the transfer matrix method.] 

13.8. Write down the transfer matrix P for a one-dimensional spin-1 Ising model in zero held, described 
by the Hamiltonian 

Efjvfo';} = -/ or, = -1,0,+1. 

i 

Show that the free energy of this model is given by 

^ [d + 2 cosh, 6 /) + V {8 + ( 2 cosh/i/ - l) 2 }] . 

Examine the limiting behavior of this quantity in the limits T -> 0 and T oc. 

13.9. (a) Apply the theory of Section 13.2 to a one-dimensional lattice gas and show that the pressure P 

and the volume per particle v are given by 

^ =!n^ {(y+l) + V[(y- l) 2 +4y>j 2 ]} 


n MT) = -kT In 


1 

v 


1 

2 



y - 1 

V[(y-D 2 + 4y>7 2 ] 


and 
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where y = zexp(4/3/) and = exp(—2,8/), z being the fugacity of the gas. Examine the high and 
the low temperature limits of these expressions. 

(b) A hard-core lattice gas pertains to the limit J -*■ — oo; this makes y -> 0 and jj -»• oo such that 
the quantity yrj 1 , which is equal to z, stays finite. Show that this leads to the equation of state 


P 

kT 


= In 


1 -P 

1-2 p 



13.10. For a one-dimensional system, such as the ones discussed in Sections 13.2 and 13.3, the 

correlation function g(r) at all temperatures is of the form exp(—ra/£), where a is the lattice 
constant of the system. Using the fluctuation-susceptibility relation (12.11.11), show that the 
low-field susceptibility of such a system is given by 


Xo=Nfifi 2 coth(a/2f). 


Note that as T -> 0 and, along with it, f -»■ oo, xo becomes directly proportional to £ — consistent 
with the fact that the critical exponent jj = 1 . 

For an «-vector model (including the scalar case n = 1), £ is given by equation (13.3.17), which 
leads to the result 


Xo = NPn 2 


I(n-2)/2(PJ) + In/2(PJ) 

hn-2)/2(PJ) ~ In/2 (PJ) 


Check that for the special case n = 1 this result reduces to equation (13.2.14). 

13.11. Show that for a one-dimensional, field-free Ising model 


a k a l a rn°n = {tanh/l/} n m+l k : 


where k < l < m < n. 

13.12. Recall the symbol n{r), of equation (13.4.5), which denotes the number of closed graphs that can 
be drawn on a given lattice using exactly r bonds. Show that for a square lattice wrapped on a 
torus (which is equivalent to imposing periodic boundary conditions) 

1 , 9 

n(4) = N, n(6) = 2N, n(8) = -N 2 + -N, _ 

Substituting these numbers into equation (13.4.5) and taking logs, one gets 


In Q(N, T)=N 


ln(2 cosh 2 K) + u 4 + 2v 6 +-v 8 + 


v= tanhfC 


Note that the term in N 2 has disappeared — in fact, all higher powers of N do the same. Why? 

13.13. According to Onsager, the field-free partition function of a rectangular lattice (with interaction 
parameters / and ]' in the two perpendicular directions) is given by 


71 71 

ln{cosh(2y) cosh(2y / ) — sinh(2y) cosw — sinh(2y') cos co'}dcodco' f 
o o 


iln«n = l» 2 +A 5 // 


where y = J/kT and y 1 = J'/kT. Show that if /' = 0, this leads to expression (13.2.9) for the 
linear chain with B = 0 while if/' = /, one is led to expression (13.4.22) for the square net. 
Focate the critical point of the rectangular lattice and study its thermodynamic behavior 
in the neighborhood of that point. 

13.14. Write the elliptic integral K\ (k) in the form 


jt/2 

*i(*)= f 


1 —Ksincf) 
V(1 — k 2 sin 2 (p) 


dcf) - 


tt/2 

1 


k sirup 


V(1 — a : 2 sin 2 0 ) 


dtp, 
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and show that, as k -> 1—, the first integral ln2 while the seconds ln[2/(l — /c 2 ) 1 / 2 ]. Hence 
K\{k) ~ ln(4/|/t'|), as in equation (13.4.34). 

[Hint: In the second integral, substitute cos cf> = x.] 

13.15. Using equations (13.4.22) and (13.4.28) at T = T c , show that the entropy of the two-dimensional 
Ising model on a square lattice at its critical point is given by 

^ ln2 - s/2K c ~ 0.3065; 

Nk n 2 


here, G is Catalan's constant, which is approximately equal to 0.915966. Compare this result with 
the ones following from the Bragg-Williams approximation and from the Bethe approximation; 
see Problem 12.15. 

13.16. The spontaneous magnetization of a two-dimensional Ising model on a square lattice at T <T C 
is given by equation (13.4.38a). Express this result in the formB|f|^{l + b\t\ -t—}, where B and p 
are stated in equation (13.4.40) while b = (1 — 9K c /^/2)/S. As usual, t= (T — T c )/T c < 0 and |t| <g 1. 

13.17. Apply the theory of Section 13.4 to a two-dimensional lattice gas and show that, at T = T c , the 
quantity kT c /P c v c ~ 10.35. 

13.18. Show that for the spherical model in one dimension the free energy at constant X is given by 


Mr _ 1 , |"/S{A.WO 2 -/ 2 )} 

N 2 2n 


Ph 2 b 2 
4 (x-jy 


while X is determined by the constraint equation 


1 

2fi\J(X 2 -J 2 ) 


fc 2 B 2 
4 (A.-/) 2 


In the absence of the field (B=0),X=y/l + 4p 2 J 2 /2p] the free energy at constant y is then given by 


PAs* _ 1 rV(l+4/l 2 / 2 ) + l 

N ~ 2 m 


1 -J(1 + AP 2 J 2 ). 


13.19. Starting with expression (13.3.8) for the partition function of a one-dimensional M-vector model, 
with li = nj', show that 


Lim —lnQ w («lif) 

n,N->cc nN 


1 

2 


s/(4K 2 + 1) - 1 - In 


s/(4K 2 + l) + l\ 

2 I 


where K = pj'. Note that, apart from a constant term, this result is exactly the same as for the 
spherical model; the difference arises from the fact that the present result is normalized to give 
Q n = 1 when K = 0. 

[Hint: Use the asymptotic formulae (for v 1) 

r(v) « (2nlv) l l 2 (v/eY 

and 

I v (vz) « (27ry)“ 1 ^ 2 (z 2 + 1 )~ 1 / 4 e'” 1 , 


where 

t)=-J{z 2 + 1) -\n[{J{z 2 + 1) + 1 }/z].] 

13.20. Show that the low-field susceptibility, xo, of the spherical model at T < T c is given by the 
asymptotic expression 

Xo « (N/i 2 /k B T) ■ Nmg(T), 

where mo(T) is the spontaneous magnetization of the system; note that in the thermodynamic 
limit the reduced susceptibility, /crT'/d/iV//. 2 , is infinite at all T < T c . Compare to Problem 13.26. 

















Problems 535 


13.21. In view of the fact that only those fluctuations whose length scale is large play a dominant role 
in determining the nature of a phase transition, the quantity {X — nk) in the expression for the 
correlation function of the spherical model, see equation (13.5.57), may be replaced by 

d 

X — /Xk = X — J cos (kja) ~ / 

/=i 

where <p = ( X/J) — d. Show that this approximation leads to the same result for G(R ) as we have in 
equation (13.5.61), with the same £ as in equation (13.5.62). 

For a similar reason, the quantity [exp(a + fie) - 1] in the correlation function (13.6.33) of the 
ideal Bose gas may be replaced by 

gV+fis _i^ a + p B = a + (pn 2 /2m)k 2 , 



leading to the same G(R) as in equation (13.6.35), with the same f as in equation (13.6.36). 10 

13.22. Consider a spherical model whose spins interact through a long-range potential varying as 

(a/r) d+a (cr > 0), r being the distance between two spins. This replaces the quantity (X — fj.^) of 
equations (13.5.16) and (13.5.57) by an expression approximating/^ + |(fcfl) cr ] for cr < 2 and 
/[</> + j ( ka ) 2 ] for a > 2 ; note that the nearest-neighbor interaction corresponds to the limit 
a -»■ oo and hence to the latter case. 

Assuming a to be less than 2, show that the above system undergoes a phase transition at a 
finite temperature T c for all d > cr. Further show that the critical exponents for this model are 

d — 2o 1 a d + o 

<*=-; -> £=-, y = 7i -’ S= H -> 

d — a 2 d — a d — cr 

1 

v=— -, t] = 2 — a 

d — o 

for a < d < 2 cr, and 

1 1 

a = 0, /J = —, y = 1, 8 = 3, v =—, ri = 2 — o 
2 a 


for d > 2a. 

13.23. Refer to Section 13.6 on the ideal Bose gas in d dimensions, and complete the steps leading to 
equations (13.6.9) through (13.6.15) and (13.6.23). 

13.24. Show that for an ideal Bose gas in d dimensions and at T > T c 


and 




(d*p 

yar 2 


\ Nk B 

did+ 2) g(d+2)/2(z) 

d g d / 2 (z) d 2 {gd/ 2 {z)} 2 g(d-i)i 2 (z) 

/ T 

V 

4 gd/2(Z) 

2 g{d-2)/2(Z) 4 lg(d-2)/2(Z)} 3 


/ B 2 fi\ _ks did-2) gd /2 (z ) 
V^/t> T 4 S(d-2)!Z(Z) 


d 2 {grf/ 2 (Z)} 2 g(rf-4)/2(Z) 
4 ig(d-2)/2(Z)} 3 


where /x(= kTlnz) is the chemical potential of the gas while other symbols have the same 
meanings as in Section 13.6. Note that these quantities satisfy the thermodynamic relationship 


VT 


d 2 P\ 

9T 2 ) 

/ v 


-NT 



= C V . 


10 A comparison with the mean field results (12.11.25) and (12.11.26) brings out a close similarity that exists between 
these models and the mean field picture of a phase transition; for instance, they all share a common critical exponent rj, 
which is zero. There are, however, significant differences; for one, the correlation length f for these models is character¬ 
ized by a critical exponent v which is nonclassical — in the sense that it is d-dependent whereas in the mean field case it 
is independent of d. 
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Also note that, since P here equals (2 U/dV), the quantities OP/dT) v and (3 2 P/3T 2 ) V are directly 
proportional to Cy and (3Cv/3T)y, respectively. Finally, examine the singular behavior of these 
quantities as T -> T c from above. 

13.25. Show that for any given fluid 

C P = VT(3P/dT)s(dP/dT) V K T 


and 


C v = VT(dP/dT)s(3P/dT)vKs, 

where the various symbols have their usual meanings. In the two-phase region, these formulae 
take the form 


C P = VT(dP/dT) 2 K T and C v = VT(dP/dT) 2 K S , 

respectively. Using the last of these results, rederive equation (13.6.30) for CV at T < T c . 

13.26. Show that for any given fluid 

K T = p~ 2 (3p/3p) T , 

where p(= N/V) is the particle density and p the chemical potential of the fluid. For the ideal 
Bose gas at T < T c , 

k bT t;(d/2 ) 

p _ Po + Pe ~- — + — — 

Using these results, show that 11 

KT*(V/k B T)( Po /p) 2 (T < T c )\ 

note that in the thermodynamic limit the reduced compressibility, kpTicpIv, is inhnite at all 
T < T c . Compare Problem 13.20. 

13.27. Consider an ideal relativistic Bose gas composed of Ni particles and N 2 antiparticles, each of rest 
mass mo, with occupation numbers 

1 a 1 

- and -, 

exp[yS(e-/xi)] -1 exp[yS(e —/r 2 )] — 1 

respectively, and the energy spectrum e = cV (p 2 + mjjc 2 ). Since particles and antiparticles are 
supposed to be created in pairs, the system is constrained by the conservation of the number 
Q (= A(i — A/ 2 ), rather than of N\ and N 2 separately; accordingly, p\ = —p 2 = p, say. 

Set up the thermodynamic functions of this system in three dimensions and examine the 
onset of Bose-Einstein condensation as T approaches a critical value, T c , determined by the 
“number density” Q/V. Show that the nature of the singularity at T = T c is such that, regardless 
of the severity of the relativistic effects, this system falls in the same universality class as the 
nonrelativistic one. 

13.28. Derive equation (13.1.9) for hard spheres in one dimension from equation (13.1.7). Plot the pair 
correlation function for nD = 0.25, 0.50, 0.75, and 0.90. Determine the structure factor S(k) 
numerically and plot it for the same densities. Compare your results with equation (13.1.21). 

13.29. Use the pair correlation function (13.1.8) and (13.1.9) to determine analytically the structure 
factor for hard spheres in one dimension. Show that S(k) is given by equation (13.1.21). Plot g(x) 
and S(k) for nD = 0.25, 0.50, 0.75, and 0.90. 

13.30. Use the Takahashi method of Section 13.1 for a system of point masses and harmonic springs of 
length a. Allow the particles to pass through each other, so that the partition function can be 
evaluated in a closed form. Show that the system is stable at zero pressure. Determine the average 


11 This remarkable relationship between the isothermal compressibility of a finite-sized Bose gas and the condensate 
density in the corresponding bulk system was first noticed by Singh and Pathria (1987b). 
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distance between particles that are far apart on the chain and the variance of that distance. 
Determine the structure factor and plot it for several values of the parameter mw 2 a 2 /kT, where 
m is the mass and ma> 2 is the spring constant. Show that the specific heat of this system is 
independent of temperature, as given by the equipartition theorem. 

13.31. Confirm the first few coefficients in the low-temperature series for the two-dimensional Ising 
model in equation (13.4.53). Write a program to calculate the energies of all 2 16 states for a 4 x 4 
periodic lattice. Show that the coefficients here are 

g q = (2, 0,32,64,424,1728,6688,13568,20524,13568,6688,1728,424,64,32,0,2). 

Extend your code to calculate the partition function for a 6 x 6 lattice, which has 2 36 states. 

13.32. Calculate the exact zero field partition function of the one-dimensional Ising model on a periodic 
chain of n spins using equation (13.2.5) and write Qn(0, T) in the form of equation (13.4.52). Show 
thatforx-*- 1, the partition function -»■ 2". Evaluate the microcanonical entropy S(q)/k = lng q 
and plot it for the case n = 16. 

13.33. Use the code posted atwww.elsevierdirect.com to evaluate equations (13.4.56) through (13.4.59) 
to determine the low-temperature series coefficients for the two-dimensional Ising model for an 
8 x8 lattice. Plot the internal energy and the specific heat as a function of temperature. Repeat 
your calculation for 16 x 16 and 32 x 32 lattices. 

13.34. Use the data posted at www.elsevierdirect.com to evaluate equations (13.4.56) through (13.4.59) 
to plot the two-dimensional Ising model internal energy and specific heat as a function of 
temperature for an Lx L lattice, where L = 64. Compare your results with the ones displayed in 
Figure 13.17. 



Phase Transitions: The 
Renormalization Group Approach 


In this chapter we propose to discuss what has turned out to be the most successful 
approach to the problem of phase transitions. This approach is based on ideas first pro¬ 
pounded by Kadanoff (1966b) and subsequently developed by Wilson (1971) and others 
into a powerful calculational tool. The main point of Kadanoff’s argument is that, as 
the critical point of a system is approached, its correlation length becomes exceedingly 
large — with the result that the sensitivity of the system to a length transformation (or 
a change of scale, as one may call it) gets exceedingly diminished. At the critical point 
itself, the correlation length becomes infinitely large and with it the system becomes totally 
insensitive to such a transformation. It is then conceivable that, if one is not too far from 
the critical point (i.e., |f|,h« 1), the given system (with lattice constant a) may bear a 
close resemblance to a transformed system (with lattice constant a' = la, where / > 1, and 
presumably modified parameters t' and h'), renormalized so that all distances in it are 
measured in terms of the new lattice constant a'\ clearly, the rescaled correlation length 

(in units of a') would be one-Zth of the original correlation length £ (in units of a). This 
resemblance in respect of critical behavior is expected only if f is also much larger than a!, 
just as £ was in comparison with a, which in turn requires that l (f/a); by implication, 
If'| and h' would also be <$: 1. 

These considerations lead to a formulation similar to the one presented in 
Section 12.10, with the difference that, while there we had to rely on a scaling hypothe¬ 
sis, here we have a convincing argument based on the role played by correlations among 
the microscopic constituents of the system which, in the vicinity of the critical point, are 
so large-scale that they make all other length scales, including the one that determines the 
structure of the lattice, essentially irrelevant. Unfortunately, Kadanoff’s approach did not 
provide a systematic means of deriving the critical exponents or of constructing the scal¬ 
ing functions that appear in the formulation. Those deficiencies were remedied by Wilson 
by introducing the concept of a renormalization group (RG) into the theory. 

We propose to discuss Wilson’s approach in Sections 14.3 and 14.4, but first we present 
a formulation of scaling along the lines indicated above and follow it with an exploration of 
simple examples of renormalization that pave way for establishing Wilson’s theory. Finally, 
in Section 14.5, we outline the theory of finite-sizing scaling that too has benefited greatly 
from the RG approach. 


Statistical Mechanics. DOI: 10.1016/8978-0-12-382188-1.00014-1 
© 2011 Elsevier Ltd. All rights reserved. 
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14.1 The conceptual basis of scaling 

The scale change in a given system can be effected in several different ways, the earliest 
one being due to Kadanoff who suggested that, when large-scale correlations prevail, the 
individual spins in the system maybe grouped into “blocks of spins,” each block consisting 
of l d original spins, and let each block play the role of a “single spin” in the transformed 
system; see Figure 14.1, where a block-spin transformation with / = 2 and d — 2 is shown. 
The spin variable of a block may be denoted by the symbol a', which arises from the values 
of the individual spins in the block, but a rule has to be established so that a' too is either 
+1 or —1, just as the original a,- were. 1 The transformed system then consists of IV' “spins,” 
where 


n' = r d N, (i) 

occupying a lattice structure with lattice constant a' = la. To preserve the spatial density of 
the degrees of freedom in the system, all spatial distances must be rescaled by the factor l, 
so that for any two “spins” in the transformed system 

r'= G 1 r. (2) 

A second way of effecting a scale change is to write down the partition function of the 
system, 

Q = ^exp[- PHnWi}], (3) 

hi) 



FIGURE 14.1 A block-spin transformation, with l — 2 and d = 2. The original lattice (a) has N (= 36) spins, the 
transformed one (b) has N' (= 9); after rescaling, the latter looks very much the same as the former, especially in 
the limit N,N’ —>■ oo. 

1 For simplicity, we employ the language of the scalar model here. 
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and carry out summation over a subset of (N — N') spins, such that one is left with a 
summation, over the remaining N' spins, which can (hopefully) be expressed in a form 
similar to (3), namely 


Q=Y2 ex pi-P H N'W{}]- ( 4 ) 

i°/) 

If the desired passage, from expression (3) to (4), can be accomplished with some degree 
of accuracy, we should expect a close resemblance between the critical behavior of the 
original system represented by equation (3) and the transformed one represented by (4); 
see Figure 14.2, where an example of this procedure with l = and d — 2 is shown. We 
note that this procedure forms the very backbone of the Wilson approach and is generally 
referred to as “decimation,” although the fraction of the spins removed, (N - N')/N, is 
rarely equal to 1 /10. Other ways of effecting a scale change will be mentioned later. 

It is quite natural to expect that the free energy of the transformed system (or, at least, 
that part of it that determines the critical behavior) is the same as that of the original sys¬ 
tem. The singular parts of the free energy per spin of the two systems should, therefore, be 
related as 




(5) 


so that 


x[r (s \t,h) = r d xlr (s \t',h'). 


( 6 ) 
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FIGURE 14.2 A scale transformation by "decimation," with l = and d = 2. The original lattice (a) has N (= 36) 
spins, the transformed one (b) has N' (= 18); the latter is yet to be rescaled (and rotated through an angle n/4) so 
that it looks very much the same as the former, especially in the limit N,N' -*■ oo. 



542 Chapter 14 • Phase Transitions: The Renormalization Group Approach 


Now, since both t and f are small in magnitude, one may assume that they are linearly 
related, that is, 


t J = l yt t, (7) 

where y t , as yet, is an unknown number. Similarly, one may assume that 

h' = mi, ( 8 ) 


with the result that 


f ( - s \t,h) = l d ^ s) (l yt t, l yh h); 


0 ) 


like y r , the number y h is also unknown at this stage of the game. 

We now assert that the function i/r w , which governs much of the critical behavior of 
the system, is essentially insensitive to a change of scale; we should, therefore, be able to 
eliminate the scale factor l from expression (9) . This essentially forces us to replace the 
variables t' and h' by a single, /-independent variable, namely 

h' h h ( Vh\ , , 

\t'\yh/yt = \t\vhivt = say 10 

at the same time, it requires us to write 

i l/ (s \t',ti) = \i\ dly, ty (h'/\t'\ A ), (11) 


leading to the identical result 


ip (s) (t,h) = \t\ d/yt ir(h/\t\ A y, (12) 

note that, as of now, the function i ]/ is also unknown. 2 Comparing (12) with equation 
(12.10.7), we readily identify the critical exponent a as 


a = 2 - (, d/y t ); (13) 

more importantly, the present considerations have led to the same scaled form for the free 
energy density of the system as was hypothesized in Section 12.10. We have thus raised the 
status of expression (12.10.7) from being a mere hypothesis to being a well-founded result 
whose conceptual basis lies in the prevalence of large-scale correlations in the system. As 
in Section 12.10, we infer that the exponents ft, y, and <5 are now given by 

P = 2-a-A = (d-y h )/y t , (14) 

y = -(2-a-2A) = (2y h -d)ly t , (15) 

2 Some authors derive equation (12) from (9) by choosing l to be . As will be seen shortly, see equation (19), this 
amounts to letting l be 0(J /a), which violates the requirement, l <g J /a, mentioned earlier. 
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and 


8 = A/p=y h /(d-y h ). (16) 

As remarked earlier, the rescaled correlation length §' of the transformed system and 
the original correlation length £ of the given system are related as 

a?) 


At the same time, we expect to be ~ \t\ v , just as £ ~ |f| v . It follows that 



Comparing (17) and (18), we conclude that 


and hence, by (13), 


V = 1/yt 


dv = 2 — a. 


(19) 


( 20 ) 


We thus obtain not only a useful expression for the critical exponent v but also the hyper¬ 
scaling relation (12.12.15) on a basis far sounder than the one employed in Section 12.12. 

Finally we look at the correlation functions of the two systems. At the critical point we 
expect that for the transformed system 

g(iV^) = i)<rV 2 )) ~ (r')-( rf - 2 +"), (21) 

just as for the original system 

g(ri,r 2 ) = <or(riMr 2 )> ~ r- (d ~ 2 +*>. (22) 

In order that equations (21) and (22) be mutually compatible, we must rescale the spin 
variables such that 

a'(r') = Z (rf_2+,,)/2 cr(r). (23) 

As for t], we may use the scaling relation y = (2 — rj)v, to get 

= d + 2 — 2 yh. (24) 


14.2 Some simple examples of renormalization 

14.2.A The Ising model in one dimension 

We start with the partition function (13.2.3a) of a closed Ising chain consisting of N 
spins, namely 

- N 

Qn(B, T) = ^exp ^2 {-Ko + Ki 0 iOi + \ + 2^2(0-/ + 

{or,) Ll=l 


(Kq = 0,Ki= PJ,K2 = 


(1) 
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FIGURE 14.3 A closed Ising chain to be "decimated" by carrying out summations over <72,04,- ■ ■■ 


the parameter Kq has been introduced here for reasons that will become clear in the sequel. 
For simplicity, we assume N to be even and carry out summation in (1) over all 04 for which 
i is even, that is, over 02 , 04 ,...; see Figure 14.3. Writing the summand in (1) as 

N l N 

nexpfKo + Knw+1 + \K 2 {oi + oi+i)} = ]~[exp{ 2 .Ko + Ki(o2j-i a 2j + a 2j a 2j+ 1) 

Z=1 ;'= 1 

+ jK2(P2j-\ +2rT 2 y +027+1)}, (2) 

the summations over a 2 j (— +1 or — 1 ) can be carried out straightforwardly, with the result 


hN 


I~[ ex P(2ATo) • 2coshj^i(or 2; -_i + o 2 j+i) + K 2 ) • exp \K 2 (a 2 j-\ + o 2 ]+ 1 ) 


7=1 


( 3 ) 


Denoting o 2 j ~1 by o'-, the partition function Qjv assumes the form 


i N 

Qn(B, T) = nexp(2^o) ■ 2cosh{Ki(oj + oj +1 ) + K 2 ) ■ exp! ^K 2 (oj + crj +1 ) 

W i=1 


( 4 ) 


The crucial step now consists in expressing (4) in a form similar to (1), namely 


Q n (B, T) = J2 ex P 


N 

Y2 {^0 + K\ a 'j a 'j +1 + 2 K 2 + a j+ 1 )} 


>1 


( 5 ) 
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This requires that, for all choices of the variables aj and o4 +1 , 

cx p{^o + K [a'jOj + 1 + \ K' 2 (a'j + oy +1 )} 

= exp(2JC 0 ) ■ 2cosh(Ki (aj + aj +1 ) + K 2 j ■ expj \K 2 (crj + oT +1 )} . (6) 

The various choices being aj = aj +1 = + l,eb = crj_ ] = -1 and a- = — o4 +1 = ±1, we obtain 
from (6) 

expCfiTg + K[+ K 2 ) = exp(2ko + K 2 ) • 2 cosh(2ki +K 2 ), (7a) 

expOKTg + K[ — K 2 ) = exp(2ko — K 2 ) ■ 2 cosh(2ki — kT 2 ), (7b) 

and 


expiK^ - K[) = exp(2kr 0 ) ■ 2 coshisT 2 . (7c) 

Solving for K ' {) , K[, and K' v we get 

e K o = 2e 2K ° (cosh( 2 i<ri + K 2 ) cosh( 2 ^Ti — K 2 ) cosh 2 K 2 \ x ^, ( 8 a) 

e K i = {cosh( 2 _RTi + kr 2 )cosh( 2 kTi — kr 2 )/cosh 2 kT 2 }b 4 , ( 8 b) 


and 


e K 2 — e K 2 {cosh( 2 ^i + kr 2 )/cosh( 2 kTi — kr 2 )}b 2 . 


( 8 c) 


We may now remark on the need to have the parameter K 0 included in expression (1) 
and, accordingly, K' {) in (5). Since we end up having three equations (7a), (7b), and (7c), to 
determine the parameters appropriate to the transformed system, the problem could not 
be solved with K\ and K 2 only; thus, even if K 0 were set equal to zero, a K' {) ^ 0 is essential 
for a proper representation of the transformed system. To highlight the role played by this 
parameter in determining the free energy of the given system, we observe on the basis of 
equations (1) and (5) that, with Kq = 0, 

Qn(KuK 2 ) = e N ' K oQ N i (K\,K' 2 ) (9) 

and hence for the free energy we have (in units of kT ) 


A N (K l} K 2 ) = —N'Kq +A n ,(K[,K! 2 ). 


( 10 ) 


Since N' = j N, we obtain for the free energy per spin the recurrence relation 


f(K u K 2 ) = -\K' 0 +lj(K' v K^, 


( 11 ) 
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which relates the free energy per spin of the given system with that of the transformed 
system; the role played by Kq is clearly significant. For example, in the limit T —»• oo, when 
(Ki,K 2 ) and along with them ( K{, K 2 ) tend to zero, equations ( 8 a) and ( 11 ) give 

/(0,0) = -Kq = -ln2, (12) 

which is indeed the correct result (arising from the limiting value of the entropy, MTn 2 , of 
the system). 

We note that the parameter Kq does not appear in equations ( 8 b) and ( 8 c), which deter¬ 
mine K[ and K ' 2 in terms of K\ and K2. As will be seen in Sections 14.3 and 14.4, it is these 
two relations that determine the singular part of the free energy of the system and hence 
its critical behavior; the parameters Kq and K' {) affect only the regular part of the free energy 
and hence play no direct role in determining the critical behavior of the system. We might 
hasten to add that, while renormalization is generally used as a technique for studying the 
properties of a given system in the vicinity of its critical point, it can be useful over a much 
broader range of the variables K\ and K 2 . For instance, in the absence of the field (K 2 — 0), 
equations ( 8 ) and ( 11 ) give 

Kq = ln{2y''[cosh(2!(Ti)]}> K[ = ln^/[cosh(21<ri)]. K 2 = 0 (13) 


and hence 


f(Ki,0) = —j ln{2 x /[cosh(2^Ti)]) + 2/(ln^[cosh(2ATi)],0); (14) 

the functional equation (14) has the solution 

f(K\,0) = — ln(2cosh£i), (15) 

valid at all K\. On the other hand, in the paramagnetic case (K] = 0), we get 

Kq = ln(2cosh^2)> = 0, K 2 = K 2 (16) 

and hence 


f(0,K 2 ) = — j ln(2coshl<r2) + 2 f(0,K 2 ), 


(17) 


with the solution 


f(0,K 2 ) = — ln(2cosh7G). (18) 

valid at all K 2 . The case when both K] and K 2 are present is left as an exercise for the reader; 
see Problem 14.2. The critical behavior of this system will be studied in Section 14.4. 
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14.2.B The spherical model in one dimension 

We adopt the same topology as in Figure 14.3 and write down the partition function of the 
one-dimensional spherical model consisting of N spins, see equation (13.5.12a), 

N 

E {^0 + Kiajffi+i + K 2 oi - Aof J j d<7\ ... dcrjy 
,i= 1 

(Ko = 0,K l = PJ,K 2 = PiiB, A = Pk), (19) 

where A is chosen so that 

(l= = N ; (20) 

see equations (13.5.9) and (13.5.13). Assuming N to be even, we carry out integrations over 
02 , 04 ,.... For this, we write our integrand as 

N 2 N 

]~[expjXo + K\ai<Ji + \ +K 2 Oi - A of J = ]"~[exp|21Co + K\(o 2 j-\o 2 j + o 2 jcr 2 j + i) 

1=1 j= l 

+ K 2 (o 2 j_i + o 2j ) - A (a£j-i + a 2j )} ( 21 ) 



and integrate over o 2 j, using the formula 


OC 

/ 


e -p x "+q x dx — { e T/ 4 P (jp > 0 ) 


( 22 ) 


After simplihcation, we get 


IN 


ft(x)' exp j ( 2jKo +S) + ^i fT 2 J-' ( 72 J+' + ( Kz +) a2] ~ 


t=i 


1 C 


^ \ _ ! — f A — (hi ] a 2 . 


2A I U2 i~ l \' 


(23) 


Denoting er 2 /_i by oh expression (19) may now be written in the renormalized form 


Qn = 


OO OO 

-/ / 

— OO —OO 


exp 


AT' 


El^o+^+i+^-AV 2 ) 

. 1=1 


dcf -^... dcjf 


(24) 
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where 


K 


1 / n \ Ky 

= -ln(-)+2is:o + —, K = 

2 VA/ 4A 1 


K[ 

2 A’ 


K'=K 2 


Ki 

1 + T 


/ K \ 

A' = A- 

2A 


and, of course, N' = ^N. It follows that, with Kq = 0, 

Q w (^i,^2,A) = e N ' K oQ Nr (K[ t K2,A') 


(25a, b) 
(25c, d) 


(26) 


and hence for the free energy per spin (in units of kT ) we have the recurrence relation 

f(K u K 2 ,A) = -i^ + A'). (27) 

The critical behavior of this system will be studied in Section 14.4. Presently we would 
like to demonstrate how the free energy of the system, over a broad range of the vari¬ 
ables K\ and K 2l can be determined by using the recurrence relation (27) along with the 
transformation equations (25). 

First of all we identify two invariants of the transformation, namely 

A' 2 - K[ 2 = A 2 -K\ (28a) 


and 


(A' - K[)m = (A - Ki)/K 2 . 


(28b) 


It turns out that it is precisely these combinations that appear in the constraint equation 
of the system as well; see Problem 13.18. It follows that the constraint equation (20) is RG- 
invariant, that is, once it is satisfied in the original system, its counterpart 


N ' 

E°f ) =Ar 

;=i 


(29) 


is automatically satisfied in the transformed system — without any need to rescale the spin 
variables. 3 Now, in the absence of the field (K 2 — 0), equations (25) and (27) give 


1 7T 

JG = -In('-'), K[ = ^, K'= 0, A'= 
0 2 VA / 1 2A 2 


and hence 


„ i, /n \ i Ik 2 

/«.A>=-ji„(x)V hi' A - 


= A — 


K\ 


Ki 

2A 


2A 


(30) 


(31) 


3 This is further related to the fact that the critical exponent rj in this case is equal to 1; see equation (14.1.23) which, 
with d = 1, gives: <r'(r') = a(r). 
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The functional equation (31) has the solution 


f(K\, A) — - In 


A + V(A 2 -k\) 


2tt 


(32) 


valid at all K \; the appropriate value of A is given by the constraint equation 

— =-^-=- = l, that is A=-J(l + 4Kh. (33) 

9A 2J{A 2 -K\) 2 vv v 

Note that the invariant (28a) in this case is equal to On the other hand, in the 
paramagnetic case (Ki = 0), we get 

K ° = lHj) + l ^ = 0 ’ K * = K2 ’ A ' = A (34) 

and hence 

m , A>—+ (35) 

with the solution 


m, A) = -ilng)- 


A 

4A ’ 


(36) 


valid at all K 2 ) the appropriate value of A is now given by 


9/ 

9A 


1 


Ki 


2A 4A 2 


= 1, that is A = 


\/(l + 4^1) T 1 


(37) 


The case when both K\ and K 2 are present is left as an exercise for the reader; see 
Problem 14.3. 


14.2.C The Ising model in two dimensions 

As our third example of renormalization, we consider an Ising model on a square 
lattice in two dimensions. In the field-free case, the partition function of this system is 
given by 


Qn(T) — ^ ex p| ^^Kaicrj > (. K = PJ ), (38) 

(cr;) In.n. J 

where the summation in the exponent goes over all nearest-neighbor pairs of spins in the 
lattice. Writing the summand in (38) as a product of factors pertaining to different pairs of 
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FIGURE 14.4 A section of the two-dimensional Ising lattice. The summed-over spins are denoted by open circles, the 
remaining ones by solid dots. To begin with, we concentrate on the summation over < 75 . 


spins, we may highlight those factors that contain a particular spin, say crs, and carry out 
summation over this spin (see Figure 14.4): 

e n e Kn 2 (T 5 . gKcr^ gKcr^crg gKcrscrg 

cr 5 =±in.n. 

= ]~[ ...[2C0sh^(ff2 + ff4 + ff6 + ff8)]---, 
n.n. 

where the primed product goes over the remaining nearest-neighbor pairs in the lattice. 
This procedure of summation is supposed to be continued until one-half of the spins, 
shown as open circles in Figure 14.4, are all summed over. Clearly, this will generate a host 
of factors of the type shown in expression (40) but the real task now is to express these fac¬ 
tors in a form similar, or at least as close as possible, to the factors appearing in the original 
expression (38); moreover, this mode of expression should be valid for all possible values 
of the remaining spins, namely 02 ,( 74 ,... = ± 1 . 

For the factor explicitly displayed in (40), there are 16 possible values for the spins 
involved, of which only four turn out to be nonequivalent; they are 


(i) a 2 = 04 = = <78, 

(41a) 

(ii) o 2 = (74 = erg = — 0 - 3 , 

(41b) 

(iii) (72 = (7 4 = -(7 6 = -ff 8 , 

(41c) 

(iv) (72 = -(74 = -cr 6 = (78. 

(4 Id) 


(39) 

(40) 


However, even four values are too many to accommodate by a factor of the form 

exp (A + B(o 2 <H + + rr e a 8 )}, 
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which contains only nearest-neighbor interactions in the transformed lattice and hence 
only two parameters to choose. Clearly, we need two more degrees of freedom, and it turns 
out that these are provided by the next-nearest-neighbor interactions and by interactions 
among a quartet of spins sitting on the corners of an elementary square in the new lattice. 
Thus, we are obliged to set 

2 coshl<r((r2 + 0-4 + + 0's) = expl^Tg + \K\o204 + 020$ + a40s + 060s) 

+ L'(0208 + CT4CT6) + M'02040608 ); (42) 

the reason why we have written \K\ rather than K', will become clear shortly. Now, the 
four possibilities listed above require that 


with the result that 


2 cosh4_KT = exptfifg + 2 K' + 2 L' + M'), 

(43a) 

2cosh2C = expOKTo — M'), 

(43b) 

2 = exp(^Tg — 2 L! + M'), 

(43c) 

2 = exp(^g - 2 K' + 2 L'+M'), 

(43d) 

Kq = In 2 + 1 lncosh2^C + g lncosh4^t, 

(44) 

K' = | lncosh4^f, 

(45) 

L 1 = g lncosh41C, 

(46) 

M' = g lncosh4^T — ^ lncosh2.K'. 

(47) 


Continuing this process, we find that the factor exp (^K'a 204 ) appears once again when 
summation over o\ is carried out, the factor exp (\ K’ 020 c) appears once again when sum¬ 
mation over <73 is carried out, and so on; no further factors involving the products 02 ^ 8 , 0-106 
and < 72 ^ 4 ( 76^8 appear. The net result is that the partition function (38) assumes the form 


Q.v = c iV hi exp l K’^OjO^ + 1! 

{°p l nn - 


E 

n.n.n. 


0 j 0 


t+^'E 

sq. 


o 1 n 


J k l m I 


(48) 


where N' = jlV. 

Clearly, we have not been able to establish an exact correspondence between the 
transformed system and the original one (in which no interactions other than the nearest- 
neighbor ones were present). It seems more reasonable now that we redefine the original 
system as one having all the interactions appearing in expression (48), but with L = M = 0. 
We may then write 


Q n (K, 0 , 0 ) = e^' K o Q N / (K' ,L' ,M') 


( 49 ) 
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and hence for the free energy per spin (in units of kT) 

= (50) 

where K^,K',L', and M' are given by equations (44) through (47). In view of the appear¬ 
ance of new parameters, L' and M', in the recurrence relation (50), no further progress can 
be made without introducing some sort of an approximation, for which see Section 14.4. 
But one thing is clear: when renormalization is carried out in two dimensions or more, the 
connectivity of the lattice requires that the Hamiltonian of the decimated system contain 
some higher-order interactions not present in the original system, in order that the lat¬ 
ter be represented correctly on transformation. It is obvious that further renormalizations 
would require more and more such interactions, and hence the need for more and more 
parameters would grow without limit. It may then be advisable that the Hamiltonian of 
the given system be regarded as a function of an “infinitely large number of parameters” 
(all but a few of which are zero to begin with), such that the number of parameters with 
a nonzero value grows indefinitely as renormalization transformations are carried out in 
succession and the number of degrees of freedom of the system steadily reduced. 

We now present a formulation of the renormalization group approach to the study of 
critical phenomena. 


14.3 The renormalization group: general formulation 

We start with a system whose Hamiltonian depends on a large number of parameters 
K\ , K2 ,... (all but a few of which are zero to begin with) and on the spin configuration {ah 
of the lattice. The free energy of the system is then given by 

exp{-pA) = J2 ex P^-hH[ai}({Ka})] a = 1,2,.... (1) 

tm) 

We now effect a “decimation” of the system, which reduces the number of degrees of 
freedom from N to N' and the correlation length from § to such that 

N' = r d N, S' = r'$ (/ > 1). (2a, b) 


Expressing the Hamiltonian of the transformed system in a form similar to the one for the 
original system, except that we now have new parameters^, along with the additional K’ Q , 
and new spins aj, equation (1) takes the form 


exp (-PA) = exp(TVWg) ^ exp ~pH {(r >, ({*£}) 


(3) 


so that the free energy per spin (in units of p 1 ) is given by 

f{{K a )) = r d {-K' 0 +f{{K' a ))}. 


( 4 ) 
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We now look closely at the transformation {IQ} {K' a } by introducing a vector space K 
in which the set of parameters IQ is represented by the tip of a position vector K; on trans¬ 
formation, K changes to K ', which may be looked upon as a kind of “flow from one position 
in the vector space to another.” This flow may be represented by the transformation 
equation 

K = tRi(K) (5) 

where IR.\ is the renormalization-group operator appropriate to the case under considera¬ 
tion. A repeated application of this process leads to a sequence of vectors K', K ",..., such 
that 

K w = tRi(K< n - ly ) = ... = tR.’{(K (0) ) n = 0,1,2,..., (6) 

where K {0) denotes the original K. At the end of this sequence, the correlation length £ and 
the singular part of the free energy f s are given by 

? (n) = rV 0 ’, fs n) = i nd fs 0) ; (7a, b) 

see equations (2b) and (4) . 

Now, the transformation (5) may have a fixed point, K *, so that 

St t {K*)=K*. (8) 

Equation (2b) then tells us that which means that £(. K*) is either zero or 

infinite! The former possibility is of little interest to us, so let us dwell only on the latter 
(which makes the system with parameters K = K* critical); in simple situations, the fixed 
point, K* , will correspond to the critical point, K c , of the given system. Conceivably, an 
arbitrary point K, on successive transformations such as (6), may end up at the fixed point 
K*. Since the correlation length § can only decrease on transformation, see equation (7a), 
and is infinite at the end of this sequence of transformations, it must be infinite at K as well 
(the same being true for all points intermediate between K and K*). The collection of all 
those points which, on successive transformations, flow into the fixed point, constitutes a 
surface over which £ is infinite; this surface is generally referred to as the critical surface. All 
flow lines in this surface are directed toward, and terminate at, the fixed point, while points 
off this surface may initially move toward the fixed point but eventually their flow lines will 
veer away from it; see Figure 14.5. Reasons behind this pattern of flow will become clear 
soon. 

For the analysis of the critical behavior we examine the pattern of flow in the neighbor¬ 
hood of the fixed point K *. 4 Setting 


K = K* + k, (9) 

4 In general, the vector K* will contain components not present in the original problem. In such a case, one has to 
locate, on the critical surface, a point K c that is free of these “unnecessary” components; since f is infinite at K = K C 
as well, the latter may be identified as the critical point of the given system. As will be seen in the sequel, the critical 
behavior of the system is still determined by the flow pattern in the neighborhood of the fixed point. 


554 Chapter 14 • Phase Transitions: The Renormalization Group Approach 



FIGURE 14.5 The parameter space of a physical system, showing critical trajectories (solid lines) flowing into the 
fixed point. The subspace of the relevant variables, t and h, is everywhere "orthogonal" to these trajectories (on all 
of which f = h = 0); the critical trajectories differ from one another only in respect of irrelevant variables that 
vanish as the fixed point is approached. The dashed lines depict that part of the flow in which the relevant 
variables play a decisive role. 


we have by equations (5) and (8) 


so that 


K = K* + k' = &i{K* + k) 


GO) 


k' = + k)-K* (11) 

Assuming {k a }, and hence {k f a }, to be small, we may linearize equation (11) to write 

k=A*,k (12) 

K=K* 

where A* is a matrix arising from the linearization of the operator fR\ around the fixed 
point K*. The eigenvalues /,,■ and the eigenvectors 0, of the matrix A] play a vital role in 
determining the critical behavior of the system. 

If the vectors </>,- form a complete set, we may expand 1c and k' in terms of </»,, 

k — M/0/, k' = ^2 «/0/> (13a, b) 

i i 

with the result that 


k 


cURi 

Hk 


u f - = XfUi /'= 1,2,...; 


( 14 ) 
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the coefficients iq appearing here are generally referred to as the scaling fields of the prob¬ 
lem. Under successive transformations (all in the neighborhood of the fixed point), these 
fields are given by 




( 15 ) 


It is obvious that the fields iq are certain linear combinations of the original parameters k a ; 
they may, therefore, be looked upon as the “generalized coordinates” of the problem. The 
relative influence of these coordinates in determining the critical behavior of the system 
depends crucially on the respective eigenvalues A.,-. With m- given by (15), we have three 
possible courses for a given coordinate zq. 


(a) If A/ > 1, the coordinate zq grows with n and, with successive transformations, 
becomes more and more significant. Clearly, zq in this case is a relevant variable 
which, by itself, drives the system away from the fixed point — thus making the 
fixed point unstable. By experience, we know that the temperature parameter 

t [— (T — T c )/T c ] and the magnetic field parameter h [= /iB/kT c \ are two basic 
quantities that vanish at the critical point and are clearly relevant to the problem of 
phase transitions. We, therefore, expect that our analysis will produce at least two 
relevant variables, zq and u 2 say, which could be identified with t and h, respectively, 
so that 

u\=at+0(t 2 ), U 2 = bh + 0(h 2 ), ( 16 , 17 ) 

with both Xi and X 2 greater than unity. 

(b) If )-i < 1, the coordinate zq decays with n and, with successive transformations, 
becomes less and less significant. Clearly, zq in this case is an irrelevant variable 
which, by itself, drives the system toward the fixed point. Now, if all the relevant 
variables are set at zero, then successive transformations (by virtue of the irrelevant 
variables) will drive the system to the fixed point. We must then be cruising on the 
critical surface itself (where all trajectories are known to flow into the fixed point). 

It follows that on the critical surface all relevant variables of the problem are zero; 
furthermore, the divergence of the correlation length is also tied to the same fact. 

(c) If Xi — 1, the coordinate zq, in the linear approximation, stays constant; it neither 
grows nor decays very rapidly unless one enters the nonlinear, beyond-scaling, 
regime of the variable zq. Quite appropriately, zq in this case is termed a marginal 
variable; it does not affect the critical behavior of the system as significantly as a 
relevant variable does, but it may throw in logarithmic factors along with the 
conventional power laws. The ability to identify such variables and to predict the 
consequent departures from simple power-law scaling is one of the added virtues 
of the RG approach. 


The above considerations enable us to understand the pattern of flow shown in 
Figure 14.5. While the points on the critical surface flow into the fixed point, those off this 
surface flow toward the fixed point by virtue of the irrelevant variables and, at the same 
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time, away from it by virtue of the relevant variables; the net result is an initial approach 
toward but a final recession away from the fixed point. Points close to the fixed point and 
in directions “orthogonal” to the critical surface have only relevant variables to contend 
with, so right away they move away from the fixed point. It is this part of the flow that 
determines the critical behavior of the given system. 

Disregarding the irrelevant variables, we now examine the manner in which the corre¬ 
lation length, £, and the (singular part of the) free energy, f s , of the system are affected by 
the transformation (15). In view of equations (7a,b), we have 

£(wi,«2 ,...) = (18) 

and 

fs(U\,U 2 ,...) = r nd fs&iUi,X%u 2 ,...). (19) 

Identifying u\ with t, as in (16), and remembering the definition of the critical exponent v, 
we obtain from (18) 


u~ v = 


( 20 ) 


with the result that 


v = lnZ/lnG. ( 21 ) 

At first sight one might wonder why v should depend on 1. In fact, it doesn’t because of the 
following argument. On physical grounds we expect that two successive transformations 
with scale factors l\ and l 2 would be equivalent to a single transformation with scale factor 
Zi Z 2 , that is , 5 


*h**k = *hk- 

This forces the eigenvalues Aj to be of the form l y ‘, for 

%% = (hk) yi . 


Relation (21) then becomes 


v = l/yi, 


( 22 ) 


(23) 


(24) 


manifestly independent of l. 

Equation (19) may now be written as 

f s (t, h, ...) = r nd f s (l m t, l ny2 h, (25) 

5 This requirement makes the set of operators St; a semigroup — not a group because the inverse of Sti does not exist. 
The reason for the nonexistence of tRJ 1 is that once a number of degrees of freedom of the system are summed over 
there is no definitive way of recreating them. 
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To ensure that the above relationship is independent of the choice of Z, we follow the same 
line of argument as in Section 14.1, after equation (14.1.9), with the result 

fs(t,h,...) = \ t\ dv f s (h/\t\ A (26) 


where 


A=y2/yi- (27) 

Equation (26) is formally the same as the scaling relationship postulated in Section 12.10 
and, one might say, argued out in Section 14.1. The big difference here is that not only has 
this relationship been derived on a firmer basis but now we also have a recipe for evaluat¬ 
ing the critical exponents v, A, and so on, from first principles. What one has to do here is 
to determine the RG operator tR./ for the given problem, linearize it around the appropri¬ 
ate fixed point K* , determine the eigenvalues (= l y ‘) and use equations (24) and (27) to 
evaluate v and A. At the same time, recalling the definition of the critical exponent a, we 
infer from (26) that 


2 — a = dv; (28) 

the remaining exponents follow with the help of the scaling relations 

P = ( 2-aO-A, y — 2A — (2 — a), 8 = A/P, /? = 2 — (y/v). (29) 

We find that the hyperscaling relation (28) is an integral part of the RG formulation; 
it comes out naturally — with no external imposition whatsoever. It is, however, dis¬ 
concerting that, according to the above argument, this relation should hold for all d — 
notwithstanding the fact that, for d > 4, all critical exponents are “stuck” at the mean field 
values and relation (28) gets replaced by 

2 — a = 4v (d> 4), (30) 

with a = 0 and v—\. The reason for this peculiar behavior is somewhat subtle; it 
arises from the fact that in certain situations an “irrelevant variable” may well raise its 
“dangerous” head and affect the outcome of the calculation in a rather significant manner. 

To see how this happens, we may consider a continuous spin model, very much like the 
one considered in Section 13.5, with the probability distribution law 

p(apdai = const. e~2 a i- ua i doi (u > 0); (31) 

compared with equation (13.5.1). The free energy of the system then depends on the 
parameter u as well as on t and h, and we obtain instead of (25) 


f s (t, h, U) = r nd fs (l ny ' t, l ny 2 h, l ny 3 U, . . .) . 


( 32 ) 
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Now, using the RG approach, one finds (see Appendix D of Fisher, 1983) that, for d > 4, 

yi=2, yz — -d+l, y 3 = 4 — d, (33) 

showing very clearly that, for d > 4, the parameter u is an irrelevant variable. One is, 
therefore, tempted to ignore u and arrive at equation (26), with 

v = ^ and A = (d + 2)/4. (34) 

The very fact that A turns out to be d -dependent shows that there is something wrong 
here. It turns out that though, on successive transformations, the variable u does tend to 
zero, its influence on the function f s does not. It may, therefore, be prudent to write 

f s (t,h,u) = \t\ d "f s Oi/\t\ A ,u/\t\^), (35) 


where 


Now, by analysis, one finds that 



4 — d 
2 


(36) 


Limfite, w) sw — Fivw 1 ^ 2 ), (37) 

w^o w 

where F(vw 1 ^ 2 ) is a perfectly well-behaved function. The singularity of/ s in w changes the 
picture altogether, and we get in the desired limit 

f s (t,h,u) ^ \t\ dv+<l, F(h/\t\ A+ yy (38) 

The “revised” value of A now is 


d + 2 4 — d 

Arev = — -1-:— : 


3 

2 ’ 


(39) 


which is indeed independent of d and agrees with the corresponding mean field value. At 
the same time, the “revised” form of the hyperscaling relation now is 


2-a = 



4 — d 
2 


= 2 , 


(40) 


as stated in (30). 

The lesson to be learnt here is that the standard derivations of the scaling relations 
rest on certain assumptions, often left unstated, about the nonsingular or nonvanishing 
behavior of various scaling functions and their arguments. In many cases these assump¬ 
tions are valid and may even be confirmed by explicit calculations or otherwise; in certain 
circumstances, however, they fail — in which case a scaling relation may change its form. 
Fortunately, such circumstances are not that common. 
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14.4 Applications of the renormalization group 

We start our considerations with the models examined in Section 14.2. 

14.4.A The Ising model in one dimension 

The renormalization group transformation in this case is given by equations (14.2.8b and 
c), namely 


K[ = \ ln[coshf2A'i + K 2 ) cosh(2AT] — K 2 )] — \ In cosh K 2 (la) 


and 


K / 2 = K2 + j ln[cosh(2.Kfi + K2) / cosh(2.Ki — K2)], (lb) 

where Ki — J/kT and K 2 — iiB/kT. It is straightforward to see that this transformation has 
a “line of trivial fixed points,” with K\ = 0 and K 2 arbitrary. These fixed points pertain to 
either / = 0 or T = 00 ; in either case, one has a correlation length that vanishes. There is 
also a nontrivial fixed point at K\ = 00 and K 2 = 0, which may be realized by first setting 
B — 0 and then letting T —> 0; the correlation length at this fixed point will be infinite. In 
the vicinity of this point, we have 

K[ ~Ki - iln2, K2 — 2 K 2 . (2a, b) 

Now, since K j = 00 , K\ is not an appropriate variable to carry out an expansion around the 
fixed point. We may adopt instead a new variable, see equation (13.2.17), namely 

t = exp(-pKi) (p > 0), (3) 

so that t* = 0; now, in the vicinity of the fixed point, we have 


t'~2 ?,/2 f. (4) 

Identifying K 2 as the variable h, and remembering that the scale factor l here is 2, we readily 
obtain from equations (2b) and (4) 


yi=P/ 2 , y 2 = 1. (5) 

The critical exponents of the model now follow straightforwardly from (5); we get 


v = 2/p, A = 2/p, 


( 6 ) 


from which, by equations (14.3.28) and (14.3.29), 


a = 2 — 2 /p, ft = 0 , y = 2 /p, 8 = 00 , 11 = 1 


(7) 


560 Chapter 14 • Phase Transitions: The Renormalization Group Approach 


in complete agreement with the results found in Section 13.2. As for the choice of p, see 
remarks following equation (13.3.21). 


14.4.B The spherical model in one dimension 

The RG transformation in this case is given by equations (14.2.25b, c, and d), namely 


K[ = 


Kf 
2 A’ 


*5 = K 2 ( 1 + ^ ) > A' = A - 


Kf 
2A ’ 


(8a, b, c) 


where K\ — J/kT,K 2 — pB/kT, and A — X/kT,X being the “spherical field’’ that was 
employed to take care of the constraint on the model. The nontrivial fixed point is again 
at T — 0, where X —J [see equation (13.5.24), with d = 1] and hence A = K\. Equations (8) 
then reduce to the linearized form (valid for small T) 


K[ — 2 K \ y K^2K 2 , A' ~ |a. (9a, b, c) 

Equations (9a) and (9c) contain essentially the same information, namely T — 2 T. Clearly, 
T itself is a good variable for expansion in this case — giving y\ = 1. Equation (9b), just like 
(2b), gives y 2 = 1, and we obtain 


v = 1, A = 1, 


GO) 


whereby 


a = 1, fi = 0, y = 1, 8 = oo, rj = 1, (11) 

in complete agreement with the results for one-dimensional models with n > 2, as quoted 
in equation (13.3.20). 


14.4.C The Ising model in two dimensions 

The RG transformation in this case is given by equations (14.2.45) through (14.2.47), 
namely 

K' = - lncosh4kT, 

4 

L! = - lncosh47C, 

8 

and 

M' = - In cosh 4K — - lncosh2kT. (14) 

8 2 

It will be recalled that, while effecting this transformation, we started only with nearest- 
neighbor interactions (characterized by a single parameter K = /)/) but, due to the 


( 12 ) 

(13) 
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connectivity of the lattice, ended up with more — namely, the next-nearest-neighbor 
interactions (characterized by L') and the four-spin interactions (characterized by M') — 
in addition to the nearest-neighbor interactions (characterized by K'). On subsequent 
transformations, still higher-order interactions come into play and the problem becomes 
formidable unless some approximations are introduced. 

In one such approximation, due originally to Wilson (1975), we discard all interactions 
other than the ones represented by the parameters K and L, and at the same time assume 
K and L to be small enough so that equations (12) and (13) reduce to 

K' = 2K 2 , L' = K 2 . (15a, b) 

Now, if the parameter L had been introduced right in the beginning, the transformation 
equations, in this very approximation, would have been 

K’ = 2 K 2 + L, L' = K 2 . (16a, b) 


We shall treat equations (16) as if they were the exact transformation equations of the 
problem and see what they lead to. 

It is straightforward to see that the transformation (16) has a nontrivial fixed point at 


K* = 


1 

3’ 



Linearizing around this fixed point, we get 


Aq — — ki + k 2 , 



(17) 


(18) 


where k\ and lc 2 represent deviations of the parameters K and L from the fixed-point val¬ 
ues K* and L*, respectively. The transformation matrix of equation (14.3.12) is then 
given by 


A* — 

^v/2 “ 


whose eigenvalues are 


1 


/ 4 

3 

2 

\3 ° J 


1 


W = -(2 + V10), *2=-(2 —V10) 


and whose eigenvectors are 


0 i 


2 + V10 
2 


02 ' 


2 - V10 
2 


(19) 


(20a, b) 


(21a, b) 


The scaling fields u t are then determined by equation (14.3.13a) which, on inversion, gives 


Mi ~ {2 k\ + Cv/10 - 2 )k 2 }, u 2 ~ {2k\ - (vTO + 2)fc 2 |. 


(22a, b) 
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Clearly, the field u\, with W > 1, is the relevant variable of the problem, while the field 112 
is irrelevant. The “critical curve” in the (K,L )-plane is determined by the condition u\ — 0, 
while the linear part of this curve, in the vicinity of the fixed point (u — 0), is mapped by 
the relation (22a). In terms of the variables k\ and k-i, this segment of the critical curve is 
given by the equation 




V10 + 2, 

-/Cl, 

3 


(23) 


which represents a straight line of slope —1.7208; see Figure 14.6. 

To determine the physical critical point of this model, we have to locate on the criti¬ 
cal curve a point with L = 0, for the original problem had no interactions other than the 
one represented by the parameter K; the corresponding value of K would be our K c . 6 7 This 
requires a mapping of the critical curve right up to the /('-axis. While this has been done 
numerically (Wilson, 1975), a crude estimate of K c can be made by simply extending the 
straight-line segment (23) down to the desired limit. One thus obtains' 


K 11 3 _ 4 + yTO 

V 3 ~9 a/10 + 2 18 


= 0.3979, 


(24) 


which may be compared with the exact result found in Section 13.4, namely 0.4407. 



FIGURE 14.6 A section of the critical curve for the two-dimensional Ising model near the nontrivial fixed point 
( K * = i,L* = g). Points on the critical curve flow into the fixed point, while those off it flow away toward the 
trivial fixed point (K* = L* = 0) or ( K * = L* = oo). 

6 Remember that at each and every point on the critical surface — in this case, the critical curve — the correlation 
length is infinite; accordingly, each and every such point is qualified to be a critical point. The physical critical point is 
the one that is free of unnecessary parameters. 

7 The result obtained through numerical analysis was 0.3921. 
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Another quantity that can be estimated here is the critical exponent v. From equa¬ 
tions (14.3.21) and (14.3.20a),one obtains 


ini 

Info 


lnV2 

ln[(2 + v '10)/3] 


0.6385, 


(25) 


which may be compared with the exact value 1. Even though a comparison of the results 
obtained here with the ones following from exact analysis is not very flattering, the basic 
merits of the RG approach are quite obvious. 

One important aspect of critical phenomena, namely their universality over a large 
class of systems, is manifest even in this simple example. Imagine, for instance, that in 
the case of the given system a next-nearest-neighbor interaction Lq were indeed present. 
Our approximate treatment would then lead to the same fixed point and the same criti¬ 
cal curve as above, but our physical critical point would now be given by that “point on 
the critical curve whose L-value is Lq”; we may denote this critical point by K c {Lo). As for 
the critical behavior, it will still be determined by an expansion around the fixed point, 
for that is where the “relevant part” of the flow is; see again Figure 14.6. Clearly, the crit¬ 
ical behavior of the given system, insofar as exponents are concerned, will be the same, 
regardless of the actual value of Lq. And, by extension, the same will be true of any two 
systems which have the same basic topology but differ only in the details of the spin-spin 
interactions. 

As for the accuracy of the results obtained here, improvements are needed in several 
important respects. First of all, the exclusion of all interaction parameters other than K and 
L constitutes a rather inadequate approximation; one should at least include the four-spin 
interaction, represented by the parameter M, and may possibly ignore the ones that appear 
on successive transformations. Next, the assumption that the parameters K and L are small 
is also unjustified, especially for K\ this makes a numerical approach to the problem rather 
essential. Thirdly, we disregarded the renormalization of the spins, from the original a (r) 
to o'(?'), as required by equation (14.1.23); in the present problem, this would amount to 
introducing a factor of (^2) ?J / 2 , that is, 2 1 -'' 16 , for i\ here is |. In the actual treatment, one 
may have to introduce an unknown parameter, p, and determine its “true” value by theo¬ 
retical analysis (see Wilson, 1975). Highly sophisticated procedures have been developed 
over the years to accommodate (or circumvent) these problems, leading to very accurate — 
in fact, almost exact — results for the model under considerations. For details, see the 
review article by Niemeijer and van Leeuwen (1976), where references to other pertinent 
literature on the subject can also be found. 8 


14.4.D The e-expansion 

Application of the RG approach to systems in higher dimensions, namely with d > 2, 
presents serious mathematical difficulties. One is then forced to resort to approximation 
procedures such as the e-expansion, first introduced by Wilson (1972); see also Wilson 

8 In this reference one can also find a systematic method of constructing the scaling function f s (ui,U 2 ,...) from a 
knowledge of the regular function/CK))) of equation (14.3.4). 
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and Fisher (1972). This procedure was inspired by the observation that the field-theoretic 
calculations of the RG formulation become especially simple as the upper critical dimen¬ 
sion, d = 4, is approached; it, therefore, seemed desirable to introduce a parameter 
e(= 4 - d) and carry out expansions of the various quantities of interest around e = 0. The 
model adopted for these calculations was the same as the one referred to in Section 14.3, 
namely the continuous, n-vector spin model, with the probability distribution given by 
equation (14.3.31) . 9 An important advantage of using continuous spins <r(r){= o <ll> (r), 
p — 1 ,...,n}, with — oc < (7 ljl) < oo, is that one can introduce Fourier transforms a{q) 
and make use of the “momentum shell integration" technique of Wilson (1971). The 
parameters of interest now are (see Fisher, 1983) 


T-T 0 t 0 - T 2 a d 

r =-= u - u „ . 

ToRl Rq TX 


(26a, b) 


and, of course, the magnetic field parameter h; here, T 0 denotes the mean-field critical 
temperature qj/k (q being the coordination number), R 0 is a measure of the range of 
interactions, a is the lattice constant, whereas li is the real-space parameter appearing in 
equation (14.3.31). The transformation equations, with a scale factor l, turn out to be 


r' = Z 2 r + 4(Z 2 — l)c(n + 2)u — l 2 lnl(n + 2)(2n 2 ) l ru, (27a) 

u' = (l + slnZ)w — (« + 8)lnZ(27r 2 ) _1 u 2 , (27b) 


and 


h' = l 3 (l- ^elnl^jh, (27c) 

correct to the orders displayed; the parameter c in equation (27a) is related to a cutoff in 
the momentum space which, in turn, is a reflection of the underlying lattice structure. 
Transformation (27) has two fixed points — the so-called Gaussian fixed point, with 


r* = u* = h* = 0, 


(28) 


and a non-Gaussian fixed point, with 


8n 2 c(n + 2) 
(n + 8) 


2jt 2 

(n+ 8) S ’ 


h* = 0. 


We now examine two distinct situations. 


(29) 


9 It can be shown that, by a suitable transformation, the Ising model (n = 1), which is a discrete (rather than a con¬ 
tinuous) model, can also be rendered “continuous” with a probability distribution similar to (14.3.31). For details, 
see AppendixA in Fisher (1983). 
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Dimension d>4, so that e is a small negative number 

One readily sees from equation (27b) that the parameter u in this case decreases on trans¬ 
formation, so on successive transformations it will tend to zero. Clearly, only the Gaussian 
fixed point is the one appropriate to this case. Linearizing around this point, we obtain for 
the transformation matrix eA^ 


with eigenvalues 


/Z 2 4(Z 2 — l)c(n + 2)\ 

1 y 0 1 + e in l J 


X r = l 2 , X u = (1 + elnZ) < 1 


and, of course, 


It follows that 


x h = z 3 



y 1 = 2, y 2 



ys 


e, 


(30) 


(31) 


(32) 


(33) 


as in equation (14.3.33). Note that the parameter u in this case is an irrelevant variable 
but, as discussed at the end of Section 14.3, it is a dangerously irrelevant variable that does 
eventually affect the results of the calculation in hand. 


Dimension d< 4, so that e is a small positive number 

The parameter u now behaves very differently. If it is already zero, it stays so; otherwise, it 
moves away from that value, carrying the system to some other fixed point — possibly the 
non-Gaussian one, with coordinates given in (29) . The resulting pattern of flow in the ( r,u )- 
plane is shown in Figure 14.7; clearly, the Gaussian fixed point is no longer appropriate 
and the problem now revolves around the non-Gaussian fixed point instead. Linearizing 
around the latter, we obtain 


with eigenvalues 



1 - 


n + 2 
n + 8 
0 



4(Z 2 - l)c(re + 2)' 
1 — e In Z ) 



n + 2 
n+ 8 



and (1 — £lnZ) < 1. 


(34) 


(35) 


We note that of the “generalized coordinates” u\ and u 2 , which are certain linear combi¬ 
nations of the parameters Ar(= r — r*) and Au(— u— if), only u\ is a relevant variable of 




566 Chapter 14 • Phase Transitions: The Renormalization Group Approach 



FIGURE 14.7 A section of the critical curve and a sketch of the RG flows in the (r, id-plane for 0 < e <5C 1. Note that 
the critical curve is straight only to order s. 


the problem . 10 Identifying zq with the temperature parameter t, we obtain 

n + 2 

v f sa 2 -g. 

y n + 8 

Combining this with expression (33) for yj v namely 


h«3--£, 


we obtain, see equations (14.3.24), (14.3.27), (14.3.28), and (14.3.29), 


which gives 


1 n + 2 3 n— 1 

v ^ — “I - -g, A ^ — “I - -g, 

2 4(n + 8) 2 2(n + 8) 


4 — n 1 3 n + 2 

am -g, B m -g y m 1 -|-g, 

2(n + 8) 2 2(ra + 8) 2(« + 8) 

5 fa 3 + g, r] m 0, 


(36) 

(37) 

(38) 

(39) 

(40) 


correct to the first power in e. 

For obvious reasons the value of e of greatest interest to us is e = 1, for which the above 
results are totally inadequate; they do show the correct trends, though. For better numeri¬ 
cal accuracy it is essential to extend these calculations to higher orders in e. Considerable 

10 It can be seen quite easily that the generalized coordinate u 2 is directly proportional to A u, making u an irrelevant 
variable of the problem; see Problem 14.6, with fl 2 i = 0. 
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progress has been made in this direction, so that we now have expressions available that 
include terms up to e 3 and, in some cases, even e 4 ; for details, see Wallace (1976). One 
wonders if that degree of extension would be good enough for obtaining reliable results 
for s as large as 1. The answer is yes, and we will illustrate it with an example. 

For the spherical model we know exact values of the various critical exponents which 
may, for the purpose of illustration, be expressed as power series in s. Thus, for instance, 


Y = 


2 

d — 2 



, 1 1 2 1 3 1 4 

1 + 2' + i‘ V + Ii' + - 


(41) 


Since the radius of convergence of this series is 2, the value 1 of e is not really as large as 
it seems. In fact, the terms displayed in (41) already give, for e = 1, y = 1.9375, as opposed 
to the correct value 2. The situation is clearly encouraging and, with better methods of 
summing up diagrams, the convergence of the e-expansions can be improved greatly. In 
fact, some of the entries in Table 13.1 were originally obtained (or at least rechecked) with 
the help of this method. 

Before closing this subsection we would like to point out a somewhat unusual piece of 
information contained in the first-order results obtained above. This refers to the exponent 
a, for which we note the prediction that for large n it is negative and hence the (singular 
part of the) specific heat vanishes at T = T c (which we know to be the case with the spher¬ 
ical model) whereas for small n it is positive and hence the specific heat diverges (which 
we know to be the case with Ising-like systems). The inversion, from one case to the other, 
takes place at n = 4 where a, according to the first-order expression (39), vanishes. The 
inclusion of the second-order term in e upholds this prediction qualitatively but changes 
it quantitatively. We now have 


4 — n (n + 2) 2 (n + 28) 2 
2(72 + 8) S 4(22+ 8) 3 8 


(42) 


so that, with s = 1, the inversion takes place between n = 1 and n = 2 — in agreement with 
the more accurate results quoted in Section 13.7. 


14.4.E The 1/n expansion 

Another approach to the problem of determining critical exponents, as functions of d and 
n, is to adopt the limiting case n = oc as the starting point and carry out expansions in 
powers of the small quantity 1/n. Clearly, the leading terms in these expansions would 
pertain to the spherical model, which has been studied in Section 13.5, and the correction 
terms would enable us to get some useful information on models with finite n. We quote 
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some first-order results here: 11 



(43) 



(44) 


and 


a — — 


4-d\ 8(4 — l)S d 1 | Q 


4 — 2] 4 — d n 



(45) 


where 


sin{7r(d — 2)/2 }T(4 — 1) 
27r{T(d/2)} 2 


(2 < d < 4). 


(46) 


We note that the coefficients of expansion in this approach are functions of d just as the 
coefficients of expansion in the preceding approach were functions of n; in this sense, the 
two expansions are complementary to one another. Unfortunately, there has not been 
much progress in the evaluation of further terms of these expansions (except for the 
one mentioned in the note); accordingly, the usefulness of this approach has been rather 
limited. 

14.4.F Other topics 

As mentioned earlier, the renormalization group approach has provided a very clear expla¬ 
nation of the concept of universality, in that it arises when several physical systems, 
despite their microscopic structural differences, are governed by a common fixed point 
and hence display a common critical behavior. Typically, this behavior is linked to the 
dimensionality d of the physical space, the dimensionality n of the spin vector a and the 
range of the spin-spin interaction. Now, depending on the precise nature of the Hamil¬ 
tonian and the relative importance of the various parameters therein, it is quite possible 
that under certain circumstances the critical behavior of the system may “cross over’’ from 
being characteristic of one fixed point to being characteristic of another fixed point. For 
instance, we may write for the spin-spin interaction in the lattice 


= a ’P = l >•••>«• 


r,r' 


li 


In the special case d = 3, the expansion for r] is known to a higher order, namely 


(47) 
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If the given interaction is isotropic in the physical space but anisotropic in the spin com¬ 
ponents (assumed three in number), so that J a P = J a 8 a p, then the system is ordinarily 
supposed to be a Heisenberg ferromagnet; however, the anisotropy of the interaction may 
finally drive the system toward an Ising fixed point (if one of the /“ dominates over the 
other two) or toward an XY fixed point (if two of the J a are equally strong and dominate 
over the third one). In either case, we encounter what is generally referred to as a crossover 
phenomenon. 

Similarly, anisotropy in the physical space, J(R) = J{Rj)Sij or K(R)RjRj, may result in a 
crossover from a d-dimensional behavior to a d! -dimensional behavior (where d! < d). In 
the same vein, one may consider a long-range interaction, J(R) ~ R~ d ~ a Sij, leading to a 
critical behavior which, for a < 2, is quite seriously a -dependent; see, for instance, Prob¬ 
lem 13.22. However, as a goes over from the value 2— to 2+, the system crosses over to the 
universality class characterized by a short-range interaction and remains in that class for 
all a > 2. Crossover phenomena constitute a very fascinating topic in the subject of phase 
transitions but we cannot pursue them here any more; the interested reader may refer to 
an excellent review by Aharony (1976). 

Another topic of considerable interest deals with the so-called interfacial phase tran¬ 
sitions in both magnets and fluids. In his seminal paper of 1944, Onsager included in his 
model a row of “mismatched spins,” calculated the boundary tension (or what is more 
commonly referred to as the interfacial free energy) of this row and examined how this 
quantity vanished as T approached T c . In the case of a fluid system, this corresponds to 
the disappearance of the meniscus between the liquid and the vapor and hence to the 
vanishing of the conventional surface tension as T -»• T c —; see, in this connection, Prob¬ 
lem 12.27. A theoretical study of such interfacial layers involves consideration of the free 
energy of an inhomogeneous system, which has been a subject of considerable research 
for quite some time. We refer the interested reader to two review articles — by Abraham 
(1986) and by Jasnow (1986) — for further reading on this topic. 

A major ingredient employed by the RG approach is the fact that the critical behav¬ 
ior of a system is invariant under a scale transformation. It did not take very long to realize 
that an important connection exists between this transformation and the well-known con¬ 
formal transformation in a complex plane, for the latter too is, roughly speaking, a scale 
transformation in which the scale factor l varies continuously with position. Though, in 
principle, this connection could be relevant in all dimensions, the most fruitful applica¬ 
tions have been in the realm of two dimensions (where the conformal group consists of 
analytic functions of a complex variable). Among the important results emerging from the 
conformal transformation approach, one may mention the form of the many-point cor¬ 
relation functions, the critical behavior of finite-sized strips of different sizes and shapes, 
and the nature of the surface critical effects. For details, see the review article by Cardy 
(1987). 

Another area of interest pertains to the so-called multicritical points, for which refer¬ 
ence maybe made to Lawrie and Sarbach (1984) for theoretical studies and to Knobler and 
Scott (1984) for experimental results. 
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14.5 Finite-size scaling 

In our study of phase transitions so far, we generally worked in the thermodynamic limit, 
that is, we started with a lattice of size L\ x ... x L,/, containing N\ x ... x Nj spins (where 
Nj — Lj/a, a being the lattice constant), but at some appropriate stage of the calculation 
resorted to the limit Lj oo. This limiting process is crucial in some important respects; 
while it simplifies subsequent calculations, it also generates singularities which, as we 
know, are a hallmark of systems undergoing phase transitions. It is of considerable inter¬ 
est, both theoretically and experimentally, to find out what happens (or does not happen) 
if some of the Lj are allowed to stay finite. The resulting analysis is quite complicated, 
but considerable progress has been made in this direction during the last 25 years or so. 
Accordingly, a whole new subject entitled “finite-size scaling’’ has emerged, of which only 
a brief summary will be presented here. The reader interested in further details may refer 
to Barber (1983), Cardy (1988), and Privman (1990). 

To fix ideas, we start with a d-dimensional bulk system (“bulk” in the sense that it is infi¬ 
nite in all its dimensions) that undergoes a phase transition at a finite critical temperature 
T c (oo); clearly, the dimensionality d must be greater than the “lower critical dimension” 
d<. We also assume that d is less than the “upper critical dimension” d>, so that the critical 
exponents of the system are d-dependent and obey the hyperscaling relation 

dv = 2 — a = 2/3 + y. ( 1 ) 

We now consider a similar system that is infinite in only d' dimensions, where d! < d, 
and finite in the remaining dimensions; the geometry of this system may be expressed as 

pd-d' x ^d' 

, where L^> a and, for simplicity, is taken to be the same in all finite dimen¬ 
sions. We may expect this system to be critical at a finite temperature T C (L), not very 
different from T c (oo). In reality, this is so only if d' too is greater than d<; otherwise, the 
system continues to be regular at all finite temperatures and the criticality sets in only at 
T = 0. 12 The cases d' > d< and d' < d <t therefore, merit separate treatments. 

Our primary goal here is to determine the L-dependence of the various physical quan¬ 
tities pertaining to the system when the system is undergoing a phase transition. We attain 
this goal by setting up a finite-size scaling law that generalizes equation (12.10.7) or equa¬ 
tion (14.3.26) to systems with a finite L. Now, since the only relevant length in the region of 
a phase transition is the correlation length £ of the system, it is natural that we scale L with 
£ — leading to the combination 

(L/S;)~Lt v = (j} lv t) v . ( 2 ) 

At the same time, the combination ( h/t A ) appearing in the bulk scaling law may be 
written as 

(h/t A ) = (hL A/v )/(L 1/v t ) A . ( 3 ) 

12 For the special case d' = 0, when the system is fully finite, this point has already been emphasized in Section 12.1. 
Here we assert that, even when some of the system dimensions are infinite (and hence the total number of spins is 
infinite), a finite-temperature singularity does not arise unless the number of those infinite dimensions exceeds rf<. 
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The appropriate combinations of L with t and h, therefore, are L ]/Iv t and L A/|; h, respec¬ 
tively. The “singular” part of the free energy density of the system may then be written in 
the form (see Privman and Fisher, 1984) 

h (s) 

f s Ht,h-L)^ vkT ^L- d Y(M,x 2 ), (4) 

where xi and X 2 are the scaled variables of the system, namely 

Xi = CiLV v t, x 2 = C 2 L A/v h, (5a, b) 

with 

T — T c (o o) , nB , 

t= , , h = - \t\,h<^\, (6a, b) 

T c (oo) kT 

while Ci and C 2 are certain nonuniversal scale factors peculiar to the system under study. 
Expressed in terms of the variables xi and x 2 , the function Y is expected to be a univer¬ 
sal function — common to all systems in the same universality class as the system under 
study. Of course, the definition of the universality class will now include (apart from the 
conventional parameters d, n, and the range of the spin-spin interaction) the parameter 
d' as well as the nature of the boundary conditions imposed on the system (which, unless 
stated otherwise, will be assumed to be periodic ). 

We note that, in the limit L -»• oo, expression (4) reduces to equation (12.10.7), provided 
that the function Y has the asymptotic form 

Y{Xi,X 2 ) « \Xi\ dv f±(X 2 /\Xi\ A ) \X\ |,x 2 » 1, (7) 

thus identifying the nonuniversal parameters F and G with Cf v and C 2 /C A , respectively. 
This enables us to write Ci and C 2 in terms of F and G, namely 

Ci ~ f 1/(2 -“\ c 2 ~F (fi+Y)/i 2 - a) G, (8a, b) 


which provides a means of determining the nonuniversal parameters Ci and C 2 from a 
knowledge of the bulk parameters F and G; any other factors appearing in (8) would be 
universal. Once Ci and C 2 are known, no more nonuniversal amplitudes are needed to 
describe the behavior of the system — regardless of whether it is finite-sized or infinite in 
extent. We are now in a position to examine the consequences of the scaling law (4) . 

With appropriate differentiations, we obtain from equation (4) the following expres¬ 
sions for the zero-field susceptibility per unit volume and the “singular” part of the 
zero-field specific heat per unit volume: 


if a 2 W s) \ 
XOit, I - ~y[ 2 l 

\ / B =0 


kTu 2 C 2 L 2A i v ~ d (d 2 Y(x i,x 2 ) 


(kT) 2 


3x| 


* 2=0 


ij. 2 c 2 Ly' v 

kT 


Tv(xi), 


(9) 
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and 


4 S, ( 


T / d 2 A W \ 

~V \ dT 2 j 

\ / B= 0 


kT 2 C 2 L c ‘' v 
7?(oo) 


Y c (x\), 


kT 2 C 2 L 2 / v ~ d 

T 2 (oo) 


d 2 Y(x i,x 2 ) 
dx ? 


X2=0 


(to) 


where F x (xi) and F c Ui) are appropriate derivatives of the universal function F(x i,X 2 ) 
and, hence, are themselves universal. We may, for further analysis, supplement the above 
results with the corresponding ones for the correlation length of the finite-sized system, 
namely 


t;(t,Iv,L ) = LS(X l,X2) 


( 11 ) 


and 


t;o(t-,L)=LS(x 1 ), (12) 

where S(xi) = S(xi,0); note that Lhe functions S(X],Xz) andS(xi) are also universal. We shall 
now focus our attention on equations (9), (10), and (12), to see what messages they deliver 
in different regimes of the variables T and L. 


Case A: T > T c ( 00 ) 

With t > 0 3.11(1 L > q, } the vsrisble x\ in this regime would, be positive 3nd much gre3ter 
than unity. The functions Y X ,Y C , and S are then expected to assume the form 

Y x (xi) & rx~^ y , Yc(xi) » Ax^ a , S(xi)^Nx^ v , (13a, b, c) 

so that we recover the standard bulk results 


Xo 


m 2 f crcf 

kT c (oo) 


Js) 


kAC 2 ~ a t~ a , ?o^JVCf 1 'r u , 


(14a, b, c) 


complete with nonuniversal amplitudes and universal factors. The effect of I in this regime 
appears only as a correction to the bulk results; under periodic boundary conditions, such 
correction terms turn out to be exponentially small, that is, 0(e~ L ^ 0 ) where ~ aP 


Case B: T ~ T c (oc) 

This case refers to the “core region” where |xi| is of order unity and hence \t\ is of order 
I -1 /”; the bulk critical point, t — 0, is at the heart of this region. Equations (9), (10), and 
(12) now yield the first significant results of finite-size scaling, namely 


pC 2 

kT c (oo) 


Xo~i^7±-P y/V ’ c o ~ kC 2 L a i v , ? 0 -L. 


3 See, for instance, Luck (1985), and Singh and Pathria (1985b, 1987a). 


(15a, b, c) 
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Case C: T < T c (oo) 

Here we must distinguish between the cases d' > d< and d' < d<. In the first case, the sys¬ 
tem becomes critical at a temperature T C (L) that is not too far removed from T c (oo); in the 
second, the system remains regular at all finite temperatures and becomes critical only at 

r = o. 


(i) d' > d< 

In view of the fact that the system is now singular at T = T C (L) rather than at T c ( oo), it seems 
natural to define a shifted temperature variable t such that 


■ T-T c (L) _ 

T c (o o) 

compare this with (6a) . Thus, for any temperature T, 

t=t — z; x = [T C {L) — T c (oo)]/T c (oo), 
which prompts us to define a shifted scaled variable 


(16) 


(17) 


X\ = C\lfl v i = X\ — X; X=C 1 L 1 /V x. 


(18) 


Clearly, the scaling functions governing the system would now be singular at ±i = 0, that 
is, at x\ = X. With no other arguments present, we presume that X will be of order unity; 
the shift in T c is thus given by 


|T| = |X|Cf 1 lr 1/u = 0(L" 1 / V ). (19) 

Now, as T -> T C (L), the correlation length of the system approaches infinity — with 
the result that, insofar as the qualitative nature of the critical behavior is concerned, the 
finite variable L, however large, becomes essentially unimportant. The behavior of the 
system, in the immediate neighborhood of T C (L), would, therefore, be characteristic of a 
d'-dimensional bulk system rather than of a d-dimensional one; accordingly, it would be 
governed by the critical exponents pertaining to d' dimensions rather than by the 

exponents a,/3,... pertaining to d dimensions. We, therefore, expect that, as x,\ -»■ 0, the 
functions Y x , Y c , and S of equations (9), (10), and (12) assume the form 

Y X (X]) ~ rx-\ Y c (x\) ^ Aq“, S(xi) ^ Nx^ v , (20a, b, c) 


Xo ~ {u 2 /kT c (oo)]rC~ i 'C%L(r-rh v i-r, 


with the result that 


(21a) 

(21b) 
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and 


£o « IVCf i L ( - v - i)/v r i . (21c) 

It is obvious that, for t < 0 but such that 1 1\ <$: 1, the same results would hold — except that 
t would be replaced by 1 1\. 

The contents of equations (21), insofar as the dependence on L and t is concerned, 
have been verified by direct calculation on a variety of systems over the years; for details, 
see the review articles by Barber (1983) and Privman (1990) cited earlier. More recently, 
Allen and Pathria (1989) have verified the nonuniversal amplitudes as well by carrying out 
an explicit calculation for the spherical model (n = oo) in the general geometry L d ~ d ' x 
oo d ' , with both d and d! greater than d < . Remarkably enough, they found that, just as the 
critical exponents a, ft,... are the same functions of d' as the exponents «,/!,... are of d, the 
universal coefficients f, A,... too are the same functions of d' as the coefficients T, A,... are 
of d; the same is true of the coefficients appearing in the presence of a magnetic field (see 
Allen and Pathria, 1991). One wonders if this would also be the case for general n\ 

(ii) d' < d< 

In this case the singularity of the problem lies at T = 0, with the result that at all finite 
temperatures the system is regular and hence expressible by smooth, analytic functions. 
We may, therefore, generalize the scaling law (4) to apply at all temperatures down to T = 0 
by simply allowing the scale factors Ci and C 2 to become T -dependent and writing (after 
Singh and Pathria, 1985b, 1986a) 


x 1 = Ci(T)L 1/v t, x 2 = C 2 (T)L A/v h, (22a, b) 

leaving t and h unchanged; the quantities Ci and C 2 must be such that, as T -> T c ( 00 ) 
from below, they approach the quantities Cj and C 2 of equations (5). Expressions (9) and 
(10) now take the form 


idCluC 

Xo (t;L) = — Y x (x 1 ) 


(23) 


and 


d 0 s> (t]L ) = kT 2 



i“ /v T c (x 1), 


(24) 


respectively, while expression (12) remains formally the same. 

Now, as we approach the critical temperature T c (which is zero here), we again expect 
the quantities xo, , and £0 to behave in a manner characteristic of a d' - dimensional bulk 
system. So, let us assume that, in limit d -> 0, our scale factors behave as 
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and our universal functions behave as 

Y x (x\) ~ \x\\ B , Y c (xi) ~ |xi| 0 , S(jei)~|jci| ff (xi-^-oo). (26a, b, c) 

The resulting T- dependence of xo>Cq \ and §o then is 

xo -r 2s “ 1+0r , Cq S) - T (2+(p)r , £ 0 ~T aT . (27a, b,c) 

The corresponding results for an «-vector, d'-dimensional model (with n > 2 and d' < d <t 
where d< = 2) are 14 


Xo ~T~ 2/(2-rf') > c (s) _ T df/&-df) i £ 0 _ r -l/(2-d')_ 


Comparing (27) with (28), we infer that 


0 = 


1 -2s- 


2 — d' 


4 >~ 


a 


r( 2 - d') 




-1 


r(2 - d') ‘ 


Very shortly we shall find, see equations (44), that 


(28a, b, c) 


(29a, b, c) 


r = — l/v(d — 2), s = y6/v(d-2), 


with the results 


0 = 2/3 + 


vd'(d - 2) 


vd'(d- 2) v(d — 2) 

0 =-...--2, cr = 


(2 - d') ’ ^ ~ (2 - d') “ (2 - d') 

The L-dependence of the various quantities now turns out to be 

Xo ~ Z,V+0)/u ~ L 2i.d-d')/(2-d’) 


c ( s) — Lp+ft/v ~ jj-2(d-d')/(2-d') 


(30a, b) 


(31a, b, c) 


(32a) 

(32b) 


and 


§0 ~ I 1+CT/y ~ l}d-d')t(2-d') _ ( 32c ) 

It is remarkable that in these last expressions the critical exponents pertaining to d dimen¬ 
sions have disappeared altogether and the resulting powers of L depend entirely on the 
geometry of the system! Expression (32a) agrees with the earlier results for xo pertaining to 
a “block” geometry (d! — 0) and to a “cylindrical” geometry {d! = 1), namely 


I L d for d' = 0 (33a) 

Xo ~ { 

[ l 2 ‘ d -b f or d' = l; (33b) 

14 For the spherical model (n = oo), these results appear in Section 13.5; see equations (13.5.34), (13.5.35), and 
(13.5.64). Since the criticality in this case occurs at absolute zero, these results hold for all models with continuous 
symmetry, that is, with n > 2. See, for instance, Section 13.3, where n is general but d' = 1. 
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see Fisher and Privman (1985). For d' = 2, the L-dependence of the various quantities 
studied here becomes exponential instead of a power law. 

To obtain results valid for all T in the range 0 < T < T c (o o), we need to know the full 
T -dependence of the scale factors Ci and C 2 . It turns out that this too can be determined 
from the properties of the corresponding bulk system — in particular, from the field-free 
bulk correlation function G(R, T), which is known to possess the following forms: 

G(R, T) ~ 7T (d - 2+ ' ;) T = 77 ( 00 ) (34) 

and 

G(R,T) = ml(T)+A(T)R- (d ~ 2) T < T c (oo), (35) 

where mo(T) is the order parameter of the bulk system and A(T) another system- 
dependent parameter . 15 Now, the correlation function of the finite-sized system may be 
written in the scaled form 


G(R,t,h;L) ^ b(T)R- {d - 2+,1) Z(Xi,X2,x 3 ), (36) 

where the scaled variables X\ and X 2 are the same given by equations (22a, b) while 
X 3 =R/L ; as usual, the scale factor D(T) is nonuniversal while the function Z(x i,JC 2 ,* 3 ) 
is universal. 

Expression (36) already conforms to (34), with x\ = X 2 = X 3 = 0. For conformity with 
(35), the function Z must possess the following asymptotic form: 

Z(Xi,X2,X3)^Zi(\Xi\ v X 3 ) d ~ 2+r >+Z 2 {\Xi\ v X3) rl X\ > oo, x 2 = 0, x 3 > 0, (37) 


with 


It follows that 


Zi = 


_'Wq(T)_ 

£>(7’)[Ci(r)|r|]”( d - 2 + , j)’ 




A(T) 


DmicdTm'”’ 


Ci(T)\t\ 


m 2 (T) 

A{T) 


\/v(d.-2) 


D(T) 


A?(T) 

m 0 \T) 


2/v(d—2) 


here, use has been made of the fact that 


(38a, b) 


(39a, b) 


v(d — 2 + ri) = (2 — a) — y = 2/3. (40) 

We shall now establish a relationship between the scale factors C 2 and D. For 
this, we utilize the fluctuation-susceptibility relation ( 12 . 11 . 12 ) which, with the help of 

15 Note that the exponent rj appears only in equation (34) and not in (35); for details, see Schultz et al. (1964) and 
Fisher et al. (1973). 
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expression (36), gives for the zero-field susceptibility per unit volume 


v 2 D(T) f Z(x lt 0,R/D ddR 


M' D(T) r Z(x i, 
X0 ’ ” a 2d kT J R d 


'- 2 +i; 


= m2 ^ (T) L 2 -%(x 1 ), 
a 2d fcr * 1 

where Z x is another universal function. Comparing (41) with (23), we get 


(41) 


D(T) ~ a 2d Cf (T). 


(42) 


Equation (39b) now gives 


C 2 (T) ~ fl“ d [A^(r)/mo' ? (7’)] 1/ ^ d - 2 >. (43) 

Equations (39) and (43) give us the full T- dependence of the scale factors Ci, C 2 , and D for 
all T in the range 0 < T < T c (o o); these equations were first derived by Singh and Pathria 
(1987a). 

Before utilizing these results we note that since, in the limit T -»• 0, nio(T) approaches 
a constant value while A(T) ~ T, expressions (39a) and (43) yield 

Cl|f| ~ T~ 1/v< - d - 2 \ C 2 ~T fi/v< - d - 2) l (44a, b) 


exactly as stipulated in equations (25) and (30). As for the T -dependence of the quantities 
Xo. and |o, we observe that, regardless of whether we keep L fixed and let T -> 0 or keep 
T fixed and let L -+ oo, in either case jq -»■ —oc; the asymptotic forms (26) of the universal 
functions Y x , Y c , and S, therefore, apply throughout the region under study. Now, with 9, <j>, 
and a given by equations (31), our final results for /o> Cq S) , and fo turn out to be 


Xo' 


.6) . 


h 2 A(T) 

a 2d kT 


m 2 (T) 

A(T) 


-.2/(2 -d') 


! T 9 

m 2 (T) 

r 

m 2 {T) 

y dr 

. A{T) . 

t 

. A(T) . 


jj>(d-d')/(2-d') 


—(4—d')/(2—d') 


r— 2 (d—d')/( 2 —d') 


(45) 


(46) 


and 


1/(2 -d’) 

jJ.d-d')/(2-d') * 

complete with nonuniversal amplitudes. Comparing (45) with (47), we find that, in the 
regime under study, 


So' 


m 2 (T) 

A(T) 


Xo/?o ~ R 2 A{T)/a 2d kT, 


(48) 
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a function of T only. In the case of the spherical model, since A(T) is proportional to T at 
all T < T c (o o), see equation (13.5.71), the quantity xo/§o * s a constant — independent of 
both T andi; see also equation (13.5.64). 

It is important to note that the above formulation ties very neatly with the one provided 
by the scale factors Ci and C 2 of equations (5) that covered cases A and B pertaining to the 
regions T > T c (oo) and T ~ T c (o o), respectively. To see this, we observe that, as T -> T c (oo) 
from below, mo(T) becomes ~ \t\P andA(T) ~ \t\ vtl ] expressions (39a) and (43) then assume 
the form 

Ci(T)\t\ ~ |f|(20-vi;)/v(d-2) _ | f |l (49a) 


and 


Ca(D ~ |f|°. (49b) 

Clearly, Ci(T) and GAT) now assume some constant values that may be identified with 
Ci and C 2 — thus providing a unified formulation through the same universal func¬ 
tions Y(xi,x 2 ), S(x i,X 2 ), and Z(xi,X 2 ,xf) covering the regions of both first-order and 
second-order phase transitions! Remarkable though it is, this finding is not really surpris¬ 
ing because, with L finite and d' < d <t the system is critical only at T = 0 and analytic 
everywhere else; so, its properties should indeed be expressible by a single set of func¬ 
tions throughout. Of course, as L -»• 00 , the criticality spreads all the way from T = 0 to 
T = T c ( 00 ). 

As regards the spin dimensionality n, our results for cases A, B, and C(i) were quite gen¬ 
eral; only in case C(ii) did we specialize to systems with continuous symmetry (n > 2). With 
a slight modification, the case of discrete symmetry (n = 1) can also be taken care of. The 
net result essentially is the replacement of the number 2 in equations (28) and henceforth 
by the “lower critical dimension,” d <t of the system — leading to results such as 16 

cf'-L-t, & T<T C { 00 ), (50a, b, c) 

with ^ = d < (d - d')/(d < - d'); compared with equations (32). Once again, the In¬ 
dependence of the various quantities of interest follows a power law, which changes to 
an exponential when d’ = d<; in the case of scalar models, this happens at d! = 1. 

Throughout this discussion we have assumed that the total dimensionality d of the 
system is less than the “upper critical dimension” d>. The case d > c/> presents some 
special problems but the net result is that, while the situation in the region T < T c (o o) 
is described by the same set of expressions as above, in the region T ~ T c (o o) it is con¬ 
siderably modified. For instance, one now gets in the region T — T c { 00 ), for d> d > and 
d d <, 


Xo ~ I f(d-d')/(d > -d') ^ 
16 See, for instance, Singh and Pathria (1986b). 


C M L< rf - d 'V(d>-d') 


(51a, b, c) 
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which may be compared with the corresponding results, (15a, b, c), for d < d>. Further¬ 
more, if d = d > and/or d' = d <} factors containing In! appear along with the power laws 
displayed in (51). For details, see Singh and Pathria (1986b, 1988, 1992). 

Finally we would like to emphasize the fact that finite-size effects in any given system 
are quite sensitive to the choice of the boundary conditions imposed on the system. For 
simplicity, we assumed the boundary conditions to be periodic. In real situations, there 
may be reasons to adopt different boundary conditions such as antiperiodic, free, and 
so on. This, in general, changes the mathematical character of the finite-size effects and 
the finite-size corrections in not only the singular part(s) of the various quantities stud¬ 
ied but in their regular part(s) as well. For comparison between theory and experiment, 
this aspect of the problem is of vital importance and deserves a close scrutiny. For lack of 
space we cannot go into this matter any further here; the interested reader may refer to a 
review article by Privman (1990), where other references on this topic can also be found. 
An allied subject in this regard is the “critical behavior of surfaces,” for which reference 
maybe made to Binder (1983) and Diehl (1986). 


Problems 


14.1. Show that the decimation transformation of a one-dimensional Ising model, with l = 2, can be 
written in terms of the transfer matrix P as 


P’{K'} =P 2 {K], 


( 1 ) 


where K and K' are the coupling constants of the original and the decimated lattice, respectively. 
Next show that, with P given by 



see equation (13.2.4), relation (1) leads to the same transformation equations among K and K' as 
(14.2.8a, b, and c). 

14.2. Verify that expression (15) of Section 14.2 indeed satisfies the functional equation (14) for the 

field-free Ising model in one dimension. Next show (or at least verify) that, with the field present, 
the functional equation (11), with K' given by (8) , is satisfied by the more general expression 


f(K\ , K2) = — In coshX 2 + J e 2K ' + e ZKl sinh 2 K2 J 


14.3. Verify that expression (32) of Section 14.2 indeed satisfies the functional equation (31) for the 
field-free spherical model in one dimension. Next show (or at least verify) that, with the field 
present, the functional equation (27), with K' given by (25), is satisfied by the more general 
expression 



where A is determined by the constraint equation 
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14.4. Consider the field-free spherical model in one dimension whose partition function is given by 
equation (14.2.24) as well as by (14.2.19), withk^ = K 2 = 0. Substituting aj = (2A/1C ) 1/2 s'- in the 
former and comparing the resulting expression with the latter, show that 

/2 tt \ n ' /2 

Qn(K 1} A) = ^—J Q n 4Ki, A"), 

where N' = | N and A" = (2A 2 /K\) — K\. This leads to the functional relation 

/(*:„«—i m(|)+ !/(*,.*"). 


Check that expression (14.2.32) satisfies this relation. 

14.5. An approximate way of implementing an RG transformation on a square lattice is provided by the 
so-called Migdal-Kadanoff transformation 17 shown in Figure 14.8. It consists of two essential 
steps: 

(i) First, one-half of the bonds in the lattice are simply removed, so as to change the length scale 
of the lattice by a factor of 2; to compensate for this, the coupling strength of the remaining 
bonds is changed from/ to 2J. This takes us from Figure 14.8(a) to Figure 14.8(b). 

(ii) Next, the sites marked by crosses in Figure 14.8(b) are eliminated by a set of one-dimensional 
decimation transformations, leading to Figure 14.8(c) with coupling strength/'. 

(a) Show that the recursion relation for a spin- \ Ising model on a square lattice, according to 
the above transformation, is 


x' = 2x 2 /(l + x 4 ), 


where x = exp(—2 K) and x! = exp(-2k'). Disregarding the trivial fixed points x* = 0 and 
x* = 1, show that a nontrivial fixed point of this transformation is 


x 


* 


— 1 +2 > /2sinh 


- sinfT 1 
3 


17 

V2 


0.5437; 


compare this with the actual value of x c , which is (^/2 - 1) ~ 0.4142. 

(b) Linearizing around this nontrivial fixed point, show that the eigenvalue A of this 
transformation is 


A = 2(1 — x*)/x* ~ 1.6785 

and hence the exponent v = ln2/lnA ~ 1.338; compare this with the actual value of v, 
which is 1. 



J J 2 J 2 J j- 



J' 

J 









































(a) (b) (c) 



FIGURE 14.8 Migdal-Kadanoff transformation on a square lattice. 


17 See Kadanoff (1976a). 
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14.6. Consider the linearized RG transformation (14.3.12), with 


f flu £*12\ 
\£*21 £* 22 / 


(3) 


such that (flnfl 22 — £*i 2 £* 2 i) / 0. We now introduce the “generalized coordinates” u\ and u 2 through 
equations (14.3.13); clearly, u\ and uz are certain linear combinations of the system parameters k\ 
and kz. 

(a) Show that the slopes of the lines U\ = 0 and 112 = 0, in the (fci, fc 2 )-plane, are 

£*21 X2 — an £*21 A-i — tin 

m 1 =-=- and m2 = -=-, 

X 2 — 0.22 £*12 M — £*22 £*12 

respectively; here ,'/.] and X 2 are the eigenvalues of the matrix Verify that the product 
m\m 2 = — £* 2 i/£*i 2 and hence the two lines are mutually perpendicular if and only if «i 2 = ** 21 - 

(b) Check that, in the special case when a\ 2 = 0 but az\ / 0, the above slopes assume the simple 
form 


m\ = 00 and m 2 = £* 2 i/(£*n — £* 22 ) 
whereas, in the special case when a 2 i = 0 but a\ 2 ± 0, they become 

m\ = (fl 22 - £*u) /£*12 and m 2 = 0; 
note that Figure 14.7 pertains to the latter case. 

(c) Examine as well the cases for which either flu or a 22 is zero; Figure 14.6 pertains to the latter 
of these cases. 

14.7. Check that the critical exponents (14.4.38) through (14.4.40), in the limit n -> 00 , agree with the 
corresponding exponents for the spherical model of Section 13.5 with d < 4. 

14.8. Show, from equations (14.4.43) through (14.4.46), that for d < 4 



where e = (4 — d) « 1. Check that these results agree with the ones following from equations 
(14.4.38) through (14.4.40) for n » 1. 

14.9. Using the various scaling relations, derive from equations (14.4.43) through (14.4.45) comparable 
expressions for the remaining exponents p, S, and v. Repeat for these exponents the exercise 
suggested in the preceding problem. 







Fluctuations and Nonequilibrium 

Statistical Mechanics 


In this course we have been mostly concerned with the evaluation of statistical averages 
of the various physical quantities; these averages represent, with a high degree of accu¬ 
racy, the results expected from relevant measurements on the given system in equilibrium. 
Nevertheless, there do occur deviations from, or fluctuations about, these mean values. 
Even though they are generally small, their study is of great physical interest for several 
reasons. 

First, such a study enables us to develop a mathematical scheme with the help of 
which the magnitude of the relevant fluctuations, under a variety of physical situations, 
can be estimated. Not surprisingly, we find that while in a single-phase system the fluctu¬ 
ations are thermodynamically negligible but they can assume considerable importance in 
multiphase systems, especially in the neighborhood of a critical point. In the latter case, 
we obtain a rather high degree of spatial correlation among the molecules of the system 
which, in turn, gives rise to phenomena such as critical opalescence. 

Second, it provides a natural framework for understanding a class of phenomena that 
come under the heading “Brownian motion”; these phenomena relate properties such as 
the mobility of a fluid system, its coefficient of diffusion, and so on, with temperature 
through the so-called Einstein relations. The mechanism of Brownian motion is vital in for¬ 
mulating, and in a certain sense answering, questions as to how “a given physical system, 
which is not in a state of equilibrium, finally approaches such a state” while “a physical 
system, which is already in a state of equilibrium, persists to stay in that state.” 

Third, the study of fluctuations, as a function of time, leads to the concept of certain 
“correlation functions” that play a vital role in relating the dissipative properties of a sys¬ 
tem, such as the viscous resistance of a fluid or the electrical resistance of a conductor, 
with the microscopic properties of the system in a state of equilibrium; this relation¬ 
ship (between irreversible processes on one hand and equilibrium properties on the 
other) manifests itself in the so-called fluctuation-dissipation theorem. At the same time, a 
study of the “frequency spectrum” of fluctuations, which is related to the time-dependent 
correlation function through the fundamental theorem of Wiener and Khintchine, is of 
considerable value in assessing the “noise” met with in electrical circuits as well as in the 
transmission of electromagnetic signals. 


Statistical Mechanics. DOI: 10.1016/B978-0-12-382188-1.00015-3 
© 2011 Elsevier Ltd. All rights reserved. 
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15.1 Equilibrium thermodynamic fluctuations 

We begin by deriving a probability distribution law for the fluctuations of certain basic 
thermodynamic quantities pertaining to a given physical system; the mean square fluctu¬ 
ations can then be evaluated, in a straightforward manner, with the help of this law. We 
assume that the given system, which may be referred to as 1, is embedded in a reservoir, 
which may be referred to as 2, such that a mutual exchange of energy, and of volume, can 
take place between the two; of course, the overall energy E and the overall volume V are 
supposed to be fixed. For convenience, we do not envisage an exchange of particles here, 
so the numbers N\ and N 2 remain individually constant. The equilibrium division of E into 
E\ and E 2t and of V into Vi and V 2> must be such that parts 1 and 2 of the composite sys¬ 
tem (1 + 2) have a common temperature T* and a common pressure P*; see Sections 1.2 
and 1.3, especially equations (1.3.6). Of course, the entropy of the composite system will 
have its largest value in the equilibrium state; in any other state, such as the one character¬ 
ized by a fluctuation, it must have a lower value. If AS denotes the deviation in the entropy 
of the composite system from its equilibrium value So, then 


AS = S — S 0 = klnClf — fclnfio. (1) 

where Qf (or Qq) denotes the number of distinct microstates of the system (1 + 2) in the 
presence (or in the absence) of a fluctuation from the equilibrium state; see equation 
(1.2.6). The probability that the proposed fluctuation may indeed occur is then given by 

p oc Qf oc exp(AS/fc); (2) 

see Section 3.1, especially equation (3.1.3). In terms of other thermodynamic quantities, 
we may write 


/ 

/ dE 2 -\-P 2 dV 2 

AS = ASi + AS 2 = ASi + J ^ ; (3) 

o 

note that the pressure P 2 and the temperature T 2 of the reservoir may, in principle, vary 
during the build-up of the fluctuation! Now, even if the fluctuation is sizable from the point 
of view of system 1, it will be small from the point of view of 2. The “variables” P 2 and T 2 
may, therefore, be replaced by the constants P* and T*, respectively; at the same time, the 
increments dE 2 and dV 2 may be replaced by —dE\ and —dV i, respectively. Equation (3) 
then becomes 


AS = ASi — (AE\ +P* AV\)/T*. (4) 

Accordingly, formula (2) takes the form 


poc exp{-(A£i - T*ASi +P*AVi)/kT*}. 


(5) 
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Clearly, the probability distribution law (5) does not depend, in any manner, on the 
peculiarities of the reservoir in which the given system was supposedly embedded. For¬ 
mula (5), therefore, applies equally well to a system that attained equilibrium in a statistical 
ensemble (or, for that matter, to any macroscopic part of a given system itself). Conse¬ 
quently, we may drop the suffix 1 from the symbols A E\, ASi, and A Vi. and the star from 
the symbols P* and T*, and write 

p<xexp{—(AE—TAS + PAV)/kT}. (6) 

In most cases, the fluctuations are exceedingly small in magnitude; the quantity A E 
may, therefore, be expanded as a Taylor series about the equilibrium value ( AE) 0 = 0, with 
the result 


a£ -(I) 0 as+ 


dE\ 
3VJ o' 


AV 


1 


d 2 E 
dS 2 


(AS) 2 + 2 


d 2 E 


dSdV , ASAV+ {sV- 2 (Al ° 
0 \ / 0 


d 2 E 


Substituting (7) into (6) and retaining terms up to second order only, we obtain 

p oc exp {—{ATAS — APAV)/2kT}] 
here, use has been made of the relations 

= T, 


B) =-P. 
dVj o 


and of the fact that the expression within the square brackets in (7) is equivalent to 


dE 


as 


dE 


av 


A — AS+A — AV= AT AS-APAV. 


(7) 


( 8 ) 


0) 


( 10 ) 


With the help of (8), the mean square fluctuations of various physical quantities and the 
statistical correlations among different fluctuations can be readily calculated. We note, 
however, that of the four A terms appearing in this formula only two can be chosen inde¬ 
pendently; the other two must assume the role of “derived quantities.” For instance, if we 
choose AT and A V to be the independent variables, then AS and A P can be written as 


AS = 


and 


3S\ 

df), 


dP 


AT + 


as \ c v 

— A V = ——AT + 
dV) T T 


d P 


| AV 

dTj v 


ar 


AP =( — ) AT + —— AV= — 


dP 


av 


dP 


dT 


AT-—AV, 

V K T V 


(ID 


( 12 ) 
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kt being the isothermal compressibility of the system. Substituting (11) and (12) into (8), 
we get 


p oc exp 


Cy 

2kT 2 


(AT) 2 — 


1 

2TcTktV 



(13) 


which shows that the fluctuations in T and V are statistically independent, Gaussian 
variables'. A quick glance at (13) yields the results 


-, kT 2 

(AT) 2 =—, 

Cy 


(AV) 2 = kTicrV, 


(14a) 


while 


(ATAV) = 0. (14b) 

Similarly, if we choose AS and A P as our independent variables, we are led to the 
distribution law 


p oc exp 


1 

2 kCp 


(AS) 2 - 


k S V 

2kT 



(15) 


which gives 


(AS) 2 = kCp, 


(AP) 2 = 


kT 

^V’ 


(16a) 


while 


(ASAP) = 0; (16b) 

here, ks denotes the adiabatic compressibility of the system. 

We note that, in general, the mean square fluctuation of an extensive quantity is directly 
proportional to the size of the system while that of an intensive quantity is inversely pro¬ 
portional to the same; in either case, the relative, root-mean-square fluctuation of any 
quantity is inversely proportional to the square root of the size of the system. Thus, except 
for situations such as the ones met with in a critical region, normal fluctuations are ther¬ 
modynamically negligible. This does not mean that fluctuations are altogether irrelevant 
to the physical phenomena taking place in the system; in fact, as will be seen in the sequel, 
the very presence of fluctuations at the microscopic level is of fundamental importance to 
several properties of the system displayed at the macroscopic level! 

With the help of the foregoing results, we may evaluate the mean square fluctuation in 
the energy of the system. With T and V as independent variables, we have 



AT + 
v 



Ah. 


A E = 


T 


(17) 
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Squaring this expression and taking averages, keeping in mind equations (14), we get 


(AE) 2 = kT 2 C v + kTK T V 



= kT 2 Cy + kTi<T 




(18) 


Now, the results derived in the preceding paragraphs determine the fluctuations of the 
various physical quantities pertaining to any macroscopic subsystem of a given system, 
provided that the number of particles in the subsystem remains fixed. The expression (14b) 
for (A V) 2 may, therefore, be used to derive an expression for the mean square fluctuation 
of the variable v (the volume per particle) and the variable n (the particle density) of the 
subsystem. We readily obtain 

(Ac) 2 = kTK T V/N 2 , (An) 2 = \(Av) 2 = kTK T N 2 /V 3 ; (19) 

ir 

note that the last result obtained here is in complete agreement with equation (4.5.7), 
which was derived on the basis of the grand canonical ensemble. A little reflection shows 
that this result applies equally well to a subsystem with a fixed volume V and a fluctuating 
number of particles N. The mean square fluctuation in N is then given by 


(AN) 2 = V 2 (A n) 2 = kT KT N 2 /V. 


( 20 ) 


Substituting (20) into (18), we obtain once again the grand canonical result for (AE) 2 , 
namely 


(AE) 2 = kT 2 C v + (AN) 2 {(dE/dN) T } 2 , 


( 21 ) 


as in equation (4.5.14). 

In passing, we note that the first part of expression (21) denotes the mean square fluc¬ 
tuation in the energy £ of a subsystem for which both N and V are fixed, just as we have 
in the canonical ensemble (N, V, T). Conversely, if we assume the energy E to be fixed, 
then the temperature of the subsystem will fluctuate, and the mean square value of the 
quantity AT will be given by (kT 2 Cy) divided by the square of the thermal capacity of the 
subsystem. The net result will, therefore, be (kT 2 /Cy), which is the same as in (14a). 

15.2 The Einstein-Smoluchowski theory 
of the Brownian motion 

The term “Brownian motion” derives its name from the botanist Robert Brown who, in 
1828, made careful observations on the tiny pollen grains of a plant under a microscope. In 
his own words: “While examining the form of the particles immersed in water, I observed 
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many of them very evidently in motion. These motions were such as to satisfy me ... that 
they arose neither from currents in the fluid nor from its gradual evaporation, but belonged 
to the particle itself.” We now know that the real source of this motion lies in the incessant, 
and more or less random, bombardment of the Brownian particles, as these grains (or, 
for that matter, any colloidal suspensions) are usually referred to, by the molecules of the 
surrounding fluid. It was Einstein who, in a number of papers (beginning in 1905), first 
provided a sound theoretical analysis of the Brownian motion on the basis of the so-called 
‘‘random walk problem” and thereby established a far-reaching relationship between the 
irreversible nature of this phenomenon and the mechanism of molecular fluctuations. 

To illustrate the essential theme of Einstein’s approach, we first consider the problem in 
one dimension. Let x(t) denote the position of the Brownian particle at time t, given that 
its position coincided with the point x = 0 at time t = 0. To simplify matters, we assume 
that each molecular impact (which, on an average, takes place after a time r*) causes the 
particle to jump a (small) distance l — of constant magnitude — in either a positive or 
negative direction along the x-axis. It seems natural to regard the possibilities Ax = +1 
and Ax = —l to be equally likely; though somewhat less natural, we may also regard the 
successive impacts on, and hence the successive jumps of, the Brownian particle to be 
mutually uncorrelated. The probability that the particle is found at the point x at time t is 
then equal to the probability that, in a series of n (= t/r*) successive jumps, the particle 
makes m(— x/l) more jumps in the positive direction of the x-axis than in the negative, 
that is, it makes \ (n + m ) jumps in the positive direction and I, in — m) in the negative. 1 
The desired probability is then given by the binomial expression 


Pnim) = 


n\ 


Utra + wbJlUcw-w)!! 



( 1 ) 


with the result that 


m = 0 and m 2 = n. (2) 

Thus, for t » r*, we have for the net displacement of the particle 

x{t) = 0 and x 2 (t) = l 2 — oc t 1 . (3) 

T* 

Accordingly, the root-mean-square displacement of the particle is proportional to the 
square root of the time elapsed: 

Xr.m.s. — %/ (x 2 (t)) = lyj (t/r*) oc f 1 ^. (4) 

It should be noted that the proportionality of the net overall displacement of the Brownian 
particle to the square root of the total number of elementary steps is a typical consequence 

1 Since the quantities x and t are macroscopic in nature while l and r* are microscopic, the numbers n and m are 
much larger than unity; consequently, it is safe to assume that they are integral as well. 
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of the random nature of the steps and it manifests itself in a large variety of phenomena in 
nature. In contrast, if the successive steps were fully coherent (or else if the motion were 
completely predictable and reversible over the time interval t), 2 then the net displacement 
of the Brownian particle would have been proportional to t 1 . 

Smoluchowski’s approach to the problem of Brownian motion, which appeared in 
1906, was essentially the same as that of Einstein; the difference lay primarily in the math¬ 
ematical procedure. Smoluchowski introduced the probability function p n (x o|x), which 
denotes the “probability that, after a series of n steps, the Brownian particle, initially at 
the point xo, reaches the point x”; the number x here denotes the distance traveled by the 
Brownian particle in terms of the length of the elementary step. Clearly, 

OO 

Pn(x 0 |x)= J2 Pn-i(xo\z)pi(z\x) (n> 1); (5) 

Z ——OO 


moreover, since a single step is equally likely to take the particle to the right or to the left, 


pi(z\x) = 


1 

2 Sz ’ x ~ 


1 

1 + 


( 6 ) 


while 


po(z\x) = S z , x . (7) 

Equation (5) is known as the Smoluchowski equation. To solve it, we introduce a generating 
function Q n (f), namely 


OO 

Qn(?)= J2 Pn(Xom X ~ X °, 
x=—oo 


( 8 ) 


from which it follows that 


OO OO 

Qo(?) = J2 Po(xo\x)^ x ~ x ° = J2 s xo,x$ x ~ x ° = k 

x =— OO x =— OO 


(9) 


Substituting (6) into (5), we obtain 


Pn(X o|X) = i(X 0 |X— 1)+ ^p„_i(X 0 |X+ 1). (10) 

2 The term “reversible" here is related to the fact that the Newtonian equations of motion, which govern this class 
of phenomena, preserve their form if the direction of time is reversed (i.e., if we change tto-t, etc.); alternatively, one 
would expect that if at any instant of time we reverse the velocities of the particles in a given mechanical system, the 
system would “retrace” its path exactly. This is not true of equations describing “irreversible” phenomena, such as the 
diffusion equation (19), with which the phenomenon of Brownian motion is intimately related. 
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Multiplying (10) by £ x x ° and adding over all x, we obtain the recurrence relation 


Qn($)= + (l/$)]Qn-l($). 


(11) 


so that, by iteration, 



( 12 ) 


Expanding this expression binomially and comparing the result with (8), we get 



for |x — Xq| > n. 


(13) 


0 


Identifying (x — xo) with m, we find this result to be in complete agreement with our previ¬ 
ous result (l). 3 Accordingly, any conclusions drawn from the Smoluchowski approach will 
be the same as the ones drawn from the Einstein approach. 

To obtain an asymptotic form of the function p n (m), we apply Stirling’s formula, 
n!s» (27rn) 1 / 2 (n/e)“, to the factorials appearing in (1), with the result 




For m<^n (which is generally true because m — 0 and m r . m . s . = n 1 / 2 , while n » 1), we 
obtain 



exp(— m 2 /2ri). 


(14) 


Taking x to be a continuous variable (and remembering that p n (m) = 0 either for even val¬ 
ues of m or for odd values of m, so that in the distribution (14), Am = 2 and not 1), we may 
write this result in the Gaussian form: 



(15) 


where 


D = Z 2 /2t*. 


(16) 


3 It is easy to recognize the additional fact that if n is even, then p n (m) = 0 for odd m, and if n is odd, then p n (m) = 0 
for even m. 
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Later on, we shall see that the quantity D introduced here is identical to the diffusion 
coefficient of the given system; equation (16) connects this quantity with the microscopic 
quantities l and t*. To appreciate this connection, one has simply to note that the prob¬ 
lem of Brownian motion can also be looked on as a problem of “diffusion” of Brownian 
particles through the medium of the fluid; this point of view is also due to Einstein. How¬ 
ever, before we embark on these considerations, we would like to present here the results 
of an actual observation made on the Brownian motion of a spherical particle immersed 
in water; see Lee, Sears, and Turcotte (1963). It was found that the 403 values of the net dis¬ 
placement Ax of the particle, observed after successive intervals of 2 seconds each, were 
distributed as follows: 


Displacement Ax, in units of ji{— 10 4 cm) 

Frequency of occurrence n 

less than —5.5 

0 

between —5.5 and —4.5 

1 

between —4.5 and —3.5 

2 

between —3.5 and —2.5 

15 

between —2.5 and —1.5 

32 

between —1.5 and —0.5 

95 

between —0.5 and +0.5 

111 

between +0.5 and +1.5 

87 

between +1.5 and +2.5 

47 

between +2.5 and +3.5 

8 

between +3.5 and +4.5 

5 

greater than +4.5 

0 


The mean square value of the displacement here turns out to be: (Ax) 2 = 2.09 x 
10 _8 cm 2 . The observed frequency distribution has been plotted as a “block diagram” in 
Figure 15.1. We have included, in this figure, a Gaussian curve based on the observed value 
of the mean square displacement; we find that the experimental data fit the theoretical 
curve fairly well. We can also derive here an experimental value for the diffusion coefficient 
of the medium; we obtain: D = (Ax) 2 /21 — 5.22 x 10 _9 cm 2 /s. 4 

We now turn to the study of the Brownian motion from the point of view of diffusion. 
We denote the number density of the Brownian particles in the fluid by the symbol n(r, t ) 
and their current density by j(r, i)\— n{r, t)v(r ,/)}; then, according to Fick’s law, 

j(r, t) = — DVn(r, t), (17) 


4 In the next section we shall see that, for a spherical particle, D= kT/Gnria where 17 is the coefficient of viscosity 
of the medium and a the radius of the Brownian particle. In the case under study, T ~ 300 K, t] ~ 10 -2 poise, and a ~ 
4 x 10 -5 cm. Substituting these values, we obtain for the Boltzmann constant: t~1.3x 10~ 16 erg/K. 
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n(AX) 



Ax 


FIGURE 15.1 The statistical distribution of the successive displacements, Ax, of a Brownian particle immersed in 
water: (Ax) r . m . s . ~ 1.45/x. 


where D denotes for the diffusion coefficient of the medium. We also have here the 
equation of continuity, namely 


3 n{r,t) 

v • j(T, t) H-Try—- = °- 

01 

Substituting (17) into (18), we obtain the diffusion equation 

o 1 dn(r,t ) 

V 2 n(r, t) - —— = 0 . 

D 3 1 


(18) 


(19) 


Of the various possible solutions of this equation, the one relevant to the present situa¬ 
tion is 


N 


r 2 \ 


which is a spherically symmetric solution and is already normalized: 


( 20 ) 


OC 

/ 


n{r, t)Anr 2 dr = N, 


( 21 ) 


N being the total number of (Brownian) particles immersed in the fluid. A comparison 
of the (three-dimensional) result (20) with the (one-dimensional) result (15) brings out 
most vividly the relationship between the random walk problem on one hand and the 
phenomenon of diffusion on the other. 

It is clear that in the last approach we have considered the motion of an “ensemble” of 
N Brownian particles placed under “equivalent” physical conditions, rather than consid¬ 
ering the motion of a single particle over a length of time (as was done in the random walk 
approach). Accordingly, the averages of the various physical quantities obtained here will 
be in the nature of “ensemble averages”; they must, of course, agree with the long-time 
averages of the same quantities obtained earlier. 
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Now, by virtue of the distribution (20), we obtain 



( 22 ) 


0 

in complete agreement with our earlier results, namely 


x(t) = 0; x 2 {t) = l 2 t/z* = 2Dtoct 1 . 


(23) 


Thus, the “ensemble” of the Brownian particles, initially concentrated at the origin, “dif¬ 
fuses out” as time increases, the nature and the extent of its spread at any time t being given 
by equations (20) and (22), respectively. The diffusion process, which is clearly irrever¬ 
sible, gives us a fairly good picture of the statistical behavior of a single particle in the 
ensemble. However, the important thing to bear in mind is that, whether we focus our 
attention on a single particle in the ensemble or look at the ensemble as a whole, the ulti¬ 
mate source of the phenomenon lies in the incessant, and more or less random, impacts 
received by the Brownian particles from the molecules of the fluid. In other words, the 
irreversible character of the phenomenon ultimately arises from the random, fluctuating 
forces exerted by the fluid molecules on the Brownian particles. This leads us to another 
systematic theory of the Brownian motion, namely the theory of Langevin (1908). For 
a detailed analysis of the problem, see Uhlenbeck and Ornstein (1930), Chandrasekhar 
(1943, 1949), MacDonald (1948-1949), and Wax (1954). 


15.3 The Langevin theory of the 
Brownian motion 


We consider the simplest case of a “free” Brownian particle, surrounded by a fluid envi¬ 
ronment; the particle is assumed to be free in the sense that it is not acted on by any other 
force except the one arising from the molecular bombardment. The equation of motion of 
the particle will then be 



( 1 ) 


where M is the particle mass, v(t) the particle velocity, and F(t) the force acting on the 
particle by virtue of the impacts received from the fluid molecules. Langevin suggested 
that the force Fit) may be written as a sum of two parts: (i) an “averaged-out” part, which 
represents the viscous drag, —v/B, experienced by the particle (accordingly, B is the mobil¬ 
ity of the system, that is, the drift velocity acquired by the particle by virtue of a unit 
“external” force) 5 and (ii) a “rapidly fluctuating” part F(t) which, over long intervals of 

5 If Stokes’s law is applicable, thenB = 1/(6; rrja), where r\ is the coefficient of viscosity of the fluid and a the radius of 
the particle (assumed spherical). 
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time (as compared to the characteristic time r*), averages out to zero; thus, we may write 

( 2 ) 


dv v - 

M— = — -+F(f); F(t) = 0. 
dt B 


Taking the ensemble average of (2), we obtain 6 




(3) 


which gives 


(v(t)) = i/(0)exp(-«/r) (r = MB). (4) 

Thus, the mean drift velocity of the particle decays, at a rate determined by the relaxation 
time r, to the ultimate value zero. We note that this result is typical of the phenomena 
governed by dissipative properties such as the viscosity of the fluid; the irreversible nature 
of the result is also evident. 

Dividing (2) by the mass of the particle, we obtain an equation for the instantaneous 
acceleration, namely 


dv v - 

— = — +A(f); A(t) = 0. (5) 

at r 

We now construct the scalar product of (5) with the instantaneous position r of the particle 
and take the ensemble average of the product. In doing so, we make use of the facts that 
(i) r v = j(dr 2 /dt), (ii) r ( dv/dt) = \(d 2 r 2 /dt 2 ) — v 2 , and (iii) {r -A) = 0. 7 We obtain 


d n id') 9 


( 6 ) 


If the Brownian particle has already attained thermal equilibrium with the molecules of 
the fluid, then the quantity {v 2 ) in this equation may be replaced by its equipartition value 
3kT/M. The equation is then readily integrated, with the result 


<' ) = 


6fcTr 2 

M 


- (1 - e~ t/x 


) . 


(7) 


6 The process of “averaging over an ensemble” implies that we are imagining a large number of systems similar to the 
one originally under consideration and are taking an average over this collection at any time f. By the very nature of the 
function F(t), the ensemble average (F(f)> must be zero at all times. 

7 This is so because we have no reason to expect a statistical correlation between the position r(t) of the Brownian 
particle and the force F(t) exerted on it by the molecules of the fluid; see, however, Manoliu and Kittel (1979). Of course, 
we do expect a correlation between the variables v(t) and F(t)\ consequently, (v ■ F) 0 (see Problem 15.7). 
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where the constants of integration have been so chosen that at, t = 0, both (r 2 ) and its first 
time-derivative vanish. We observe that, for t r, 


9 3kT n 9 , 
(r 2 ).—t 2 = (^)t 2 , 


( 8) 8 


which is consistent with the reversible equations of motion whereby one would simply 
have 


r = vt. 


0 ) 


On the other hand, for t » r, 


(T Z )-^-t=mkT)t, ( 10) 9 

which is essentially the same as the Einstein-Smoluchowski result (15.2.22); incidentally, 
we obtain here a simple, but important, relationship between the coefficient of diffusion 
D and the mobility B, namely 


D = BlcT, 


( 11 ) 


which is generally referred to as the Einstein relation. 

The irreversible character of equation (10) is self-evident; it is also clear that it arises 
essentially from the viscosity of the medium. Moreover, the Einstein relation (11), which 
connects the coefficient of diffusion D with the mobility B of the system, tells us that the 
ultimate source of the viscosity of the medium (as well as of diffusion) lies in the random, 
fluctuating forces arising from the incessant motion of the fluid molecules; see also the 
fluctuation-dissipation theorem of Section 15.6. 

In this context, if we consider a particle of charge e and mass M moving in a viscous 
fluid under the influence of an external electric field of intensity E, then the “coarse¬ 
grained” motion of the particle will be determined by the equation 


M 4~Jv) = -hv) + eE; 
at B 


( 12 ) 


compare this to equation (3). The “terminal” drift velocity of the particle would now be 
given by the expression ( eB)E , which prompts one to define (eB) as the “mobility” of the 
system and denote it by the symbol p. Consequently, one obtains, instead of (11), 



e 




(13) 


which, in fact, is the original version of the Einstein relation; sometimes this is also referred 
to as the Nernst relation. 


s Note that the limiting solution (8) corresponds to “dropping out” the second term on the left side of equation (6). 
9 Note that the limiting solution (10) corresponds to “dropping out" the first term on the left side of equation (6). 
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So far we have not felt any direct influence of the rapidly fluctuating term A(t) that 
appears in the equation of motion (5) of the Brownian particle. For this, let us try to eval¬ 
uate the quantity ( v 2 (t )) which, in the preceding analysis, was assumed to have already 
attained its “limiting” value 3 kT/M. For this evaluation we replace the variable t in equa¬ 
tion (5) by u, multiply both sides of the equation by exp(w/r ), rearrange and integrate over 
du between the limits u = 0 and u — t; we thus obtain the formal solution 

t 

0 

Thus, the drift velocity v(t) of the particle is also a fluctuating function of time; of course, 
since { A(u )) = 0 for all u, the average drift velocity is given by the first term alone, namely 

{v(t)) = v(0)e~d T , (15) 


which is the same as our earlier result (4). For the mean square velocity (v 2 (t)), we now 
obtain from (14) 


( v 2 (t )) = i^tOie -2 ^ + 2e~ 2t/z 


L 

1/(0) - J e “/ r {A(u))du 


t t 

+ e~ 2t/T J J e <U]+U2>/t (A(U]) ■A(U 2 ))du i du 2 . 
o o 


(16) 


The second term on the right side of this equation is identically zero, because (A(u)) van¬ 
ishes for all u. In the third term, we have the quantity (A(u \) -A(U 2 )), which is a measure of 
the “statistical correlation between the value of the fluctuating variable A at time u\ and its 
value at time Hi"', we call it the autocorrelation function of the variable^ and denote it by 
the symbol Ka{u\,U 2 ) or simply by K(U] , 112 ). Before proceeding with (16) any further, we 
place on record some of the important properties of the function K{U\,U 2 ). 


(i) In a stationary ensemble (i.e., one in which the overall macroscopic behavior of the 
systems does not change with time), the function K(u\, U 2 ) depends only on the time 
interval (U 2 — U\). Denoting this interval by the symbol s, we have 


K{u\,ui + 5 ) = ( A(u\) -A(u\ + s)) = K(s), independently of U\ . (17) 

(ii) The quantity K(0), which is identically equal to the mean square value of the variable 
A at time ii \, must be positive definite. In a stationary ensemble, it would be a 
constant, independent of up. 


K( 0) = const. > 0. 


( 18 ) 


(iii) For any value of s, the magnitude of the function K(s) cannot exceed K( 0). 
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Proof: Since 

(\A(ui) ± A(u 2 )\ 2 ) = {A 2 (u i )> + (A 2 (ii 2 ))±2(A(u 1 )-A(u 2 )) 

= 2{K(0)±K(s)}>0, 

the function K(s) cannot go outside the limits —K( 0) and +K(0); consequently, 

|7T(s)| < K(0) for alls. (19) 

(iv) The function K(s) is symmetric about the value s = 0, that is, 

K(-s) = K(s) = KQs\). (20) 


Proof: 


K(s ) = (A(u\) A(ui +s)) = (A(U] —s) -A(u\)) 10 
= {A(u\)-A(u\ —s )) =K(-s). 


(v) As 5 becomes large in comparison with the characteristic time r*, the values A(ui) 
an&A(u\ +s) become uncorrelated, that is 


K(s) = (A{ui) -A(ui+ s)) -> {A(u\)) ■ {A(ui + s)> = 0. (21) 

In other words, the “memory” of the molecular impacts received during a given interval 
of time, say between u\ and u\ + du\, is “completely lost” after a lapse of time large in 
comparison with r*. It follows that the magnitude of the function K(s) is significant only 
so long as the variable s is of the same order of magnitude as r*. 

Figures 15.7 through 15.9 later in this chapter show the 5-dependence of certain typical 
correlation functions K(s); they fully conform to the properties listed here. 

We now evaluate the double integral appearing in (16): 

t t 

7 = J J e (U]+U2 ^ T K(u 2 — U\)du\du 2 . ( 22 ) 

0 0 


Changing over to the variables 

S = — (Mi + u 2 ) and s=(u 2 — u\), (23) 

the integrand becomes exp(2S/r)K(s), the element (dii] du 2 ) gets replaced by the corre¬ 
sponding element ( dSds ) while the limits of integration, in terms of the variables S and s, 


10 This is the only crucial step in the proof. It involves a ‘‘shift,’’ by an amount s, in both instants of the measurement 
process; the equality results from the fact that the ensemble is supposed to be stationary. 
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FIGURE 15.2 Limits of integration, of the double integral I, in terms of the variables S and s. 


can be read from Figure 15.2; we find that, for 0 < S < f/2, s goes from -2S to +2S while, for 
t/2 < S < t, it goes from -2 (t - S ) to +2(t - S). Accordingly, 


f/2 +2S f +2«-S) 

I = J e 2S hdS J K(s)ds+ J e 2S ^ z dS J K(s)ds. 

0 —2S f/2 —2(t—S) 


(24) 


In view of property (v) of the function K(s), see equation (21), the integrals over .s' draw 
significant contribution only from a very narrow region, of the order oft*, around the value 
5 = 0 (i.e., from the shaded region in Figure 15.2); contributions from regions with larger 
values of |s| are negligible. Thus, if t » r*, the limits of integration for 5 may be replaced by 
—oo and +oo, with the result 


where 


f 

I-cJe 2S/z dS = C^(e 2t/z - 1), 
0 


C = 


oo 

J K(s)ds. 

—OO 


(25) 


(26) 


Substituting (25) into (16), we obtain 


(y 2 (r)) = v 2 (G)e~ 2tlz + c|(l - e~ 2t/z ). 


(27) 


Now, as t -*■ oo, (v 2 (l)) must tend to the equipartition value 3 kT/M; therefore, 


C = 6fcT/Mr 


(28) 
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and hence 


<c 2 (0) = c 2 (0) + j ^ - iAO) j (1 - e' 2f/T ). 


(29) 11 


We note that if v 2 (0) were itself equal to the equipartition value 3 kT/M, then (v 2 it)) would 
always remain the same, which shows that statistical equilibrium, once attained, has a 
natural tendency to persist. 

Substituting (29) into the right side of ( 6 ), we obtain a more representative description 
of the manner in which the quantity (r 2 ) varies with t; we thus have 

d <) 1 d 'y 9 9,6 kT 'yti-r 

-rr<r 2 ) + --r(r 2 ) = 2v 2 (0)e~ 2t ' T + — (1 - e“ 2f / T ), (30) 

x dt M 


with the solution 

(r 2 ) = p 2 ( 0 )r 2 (l - e _f/T ) 2 - ^t 2 (1 - e~ t/T )(3 - e~ t/z ) + t. (31) 

Solution (31) satisfies the initial conditions that both (r 2 ) and its first time-derivative van¬ 
ish at t — 0; moreover, if we put n 2 (0) = 3 kT /M, it reduces to solution (7) obtained earlier. 
Once again, we note the reversible nature of the motion for t <$; r, with (r 2 ) ~ v 2 (0)t 2 , and 
its irreversible nature for t » r, with (r 2 ) ~ ( 6 BkT)t. 

Figures 15.3 and 15.4 show the variation, with time, of the ensemble averages (v 2 (t)) 
and (r 2 (f)> of a Brownian particle, as given by equations (29) and (31), respectively. All 
important features of our results are manifestly evident in these plots. 

Brownian motion continues to be a topic of contemporary research nearly 200 years 
after Brown’s discovery and over 100 years after Einstein and Smoluchowski’s analysis and 
early measurements by Perrin. The renewed interest is due to the growth in the techno¬ 
logical importance of colloids across a wide range of fields and the development of digital 
video and computer image analysis. An interesting example is the detailed observation and 
analysis of rotational and two-dimensional translational Brownian motion of ellipsoidal 
particles by Han et al. (2006) in a thin microscope slide. The case of rotational Brownian 
motion was first analyzed by Einstein (1906b) and first measured by Perrin (1934, 1936). 
Both rotational and translational modes diffuse according to Langevin dynamics but the 
translational diffusion is coupled to the rotational diffusion since the translational diffu¬ 
sion constant parallel to the longer axis is larger than the diffusion constant perpendicular 

11 One may check that 

^<c 2 (0) = ^ [c 2 (oo)~ (c 2 (t))] = - 2 -\(v 2 (t)), 

where dToo) = 3kT /M and A(v 2 tf)) is the "deviation of the quantity concerned from its equilibrium value.’’ In this form 
of the equation, we have a typical example of a “relaxation phenomenon,” with relaxation time r/2. 
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FIGURE 15.3 The mean square velocity of a Brownian particle as a function of time. Curves 1, 2, and 3 correspond, 
respectively, to the initial conditions f 2 (0) = 6kT/M,3kT/M, and 0. 



FIGURE 15.4 The mean square displacement of a Brownian particle as a function of time. Curves 1, 2, and 3 
correspond, respectively, to the initial conditions t> 2 (0) = 6 kT/M, 3 kT/M, and 0. 


to that axis. The rotational diffusion and both long-axis (a) and short-axis (b) body-frame 
diffusions are all Gaussian: 


1 / (A6>) 2 \ 

wla9 '' , = VSBr x T wj’ 

p * (A *‘' ,, = 75ro ex Tw)’ 

1 / (Ax b ) 2 \ 

,,l(AXb '' ,= 75^ exp (-«v)' 


(32a) 

(32b) 

(32c) 
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with diffusion constants Dg, D a , and D[,. Experiments have observed the complex 
two-dimensional spatial diffusion at short times {t < xg = 1/(2 Dg)), as predicted by the 
Langevin theory. The long-time it » r g) spatial diffusion is isotropic with diffusion con¬ 
stant D = (D a + D[,)/2. 

15.3.A Brownian motion of a harmonic oscillator 

An analysis similar to the one for a diffusing Brownian particle can also be performed 
for a particle in a harmonic oscillator potential that prevents the particle from diffus¬ 
ing away from the origin and allows a more general analysis of the relationship between 
the position and velocity response functions and the power spectra of the fluctua¬ 
tions; see Kappler (1938) and Chandrasekhar (1943). The one-dimensional equation of 
motion for a Brownian particle of mass M in a harmonic oscillator potential with spring 
constant M&>q is 


d 2 x dx o 
dfi +Y d~t +<V ° X = 


m 

M ’ 


(33) 


where y (= Gitria/M) is the damping coefficient of a spherical particle in a fluid with 
viscosity /?. Just as in the case of diffusive Brownian motion, the force F(L) can be a time- 
dependent external force designed to explore the response function or a time-dependent 
random force due to collisions with molecules in the fluid to analyze the equilibrium fluc¬ 
tuations. Assuming the system was in equilibrium in the distant past, the position at time 
t is given by 

t 

x(t) = J x xx (t- t')F(t')dt', (34) 

— OO 

where 


y_xx($) 


1 . 

- e 2 sm(a)is) 

Midi 


(35) 


is the xx response function and u>i 



z± 

4 


12 The velocity response is given by 


t 

V(t) = J Xvx(t-t')F(t')dt', (36) 

— OO 

12 This form of the response function assumes that the oscillator is underdamped. The notation xxx refers to the 
notation used in Section 15.6.A in which the response of the position coordinate x depends on the applied field F that 
couples to the Hamiltonian via a term -F(t)x(t). 
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where 


XvxiS) — 



ys 

2 


^COS (wiS) — 


Y 

2 co i 


sin(cDis) ). 


(37) 


The response of the system can be decomposed into a sum of independent terms 
involving a sinusoidal applied force F(a>)e l(ot . This takes the form 


x(co) = Xxx(o})F(a>), (38) 

where the frequency-dependent response function can be decomposed into real and 
imaginary parts x' xx (w) and 


oo 

Xxx(<i>) — j Xxx{s)e koS ds = x' X xhr>) + *Xja:( <w )> 
i 

Xxx^ 0> ) = 


X-xx^ 0 ’) — 


M[(a> g — a > 2 ) 2 + y 2 a> 2 ] 
yen 

M[(ft)g — to 2 ) 2 + y 2 a> 2 ] 


(39a) 


(39b) 

(39c) 


The real part here describes the dispersion and the imaginary part describes the dissipa¬ 
tion, that is, it sets the average rate of energy dissipation due to the sinusoidal external 
force. 

Now let’s consider the natural fluctuations of the position and the velocity of the parti¬ 
cle in equilibrium due to the random collisions with the atoms in the fluid. We will use the 
same Langevin formalism as was used earlier with Brownian motion of a free particle. The 
random force averages to zero and is assumed to be delta-function correlated in time: 


(F) = 0, (40a) 

(J 7 (t)F(t , )) = r5(t-t'), (40b) 

where T = 2 yMkT. With this choice, the long-time average position and velocity of the 
particle are both zero, 


(x(t)) = (v{t)) = 0, (41) 

and the average of the squares of the position and velocity both obey the equipartition 
theorem: 


(x 2 (t)) = 


kT 

Mco 2 ’ 


o kT 
(^AW 


(42a, b) 
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The xx correlation function is given by 


kT 






2 


(43) 


and the xx power spectrum by 


OO 



(44) 


Note that the imaginary part of the response function, x xx (oj), in equation (39c) is propor¬ 
tional to the power spectrum S xx (a>): 



(45) 


This result indicates that the dissipation that results from driving a system out of equilib¬ 
rium by an external force is proportional to the power spectrum of the natural fluctuations 
that occur in equilibrium. While this result was derived here for a very specific model, it 
constitutes an example of the very general fluctuation-dissipation theorem we will derive 
in Section 15.6.A. 


15.4 Approach to equilibrium: the 
Fokker-Planck equation 


In our analysis of the Brownian motion we have considered the behavior of a dynamical 
variable, such as the position r(t) or the velocity v(t) of a Brownian particle, from the point 
of view of fluctuations in the value of the variable. To determine the average behavior of 
such a variable, we sometimes invoked an “ensemble” of Brownian particles immersed 
in identical environments and undergoing diffusion. A treatment along these lines was 
carried out toward the end of Section 15.2, and the most important results of that treat¬ 
ment are summarized in equation (15.2.20) for the density function n(r, t) and in equation 
(15.2.22) for the mean square displacement ( r 2 (t )). 

A more generalized way of looking at “the manner in which, and the rate at which, 
a given distribution of Brownian particles approaches a state of thermal equilibrium” is 
provided by the so-called Master Equation, a simplified version of which is known as the 
Fokker-Planck equation. For illustration, we examine the displacement, x(f), of the given 
set of particles along the x-axis. At any time t, let f(x, t)dx be the probability that an arbi¬ 
trary particle in the ensemble may have a displacement between x and x + dx. The function 
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f(x, t) must satisfy the normalization condition 

OO 

J f(x,t)dx=\. (1) 

—OO 


The Master Equation then reads: 

OO 

df(x,t) _ f ^_^ x ^^ ) Y/( X , x !)+f( X !,i)W{ X !,x)}d X !, (2) 

3 1 J 

— OO 

where W(x,xf)dx'St denotes the probability that, in a short interval of time St, a parti¬ 
cle having displacement x makes a “transition” to having a displacement between x' and 

x! + dx'. u 

The first part of the integral in equation (2) corresponds to all those transitions that 
remove particles from the displacement x at time t to some other displacement xf and, 
hence, represent a net loss to the function/(x, t); similarly, the second part of the integral 
corresponds to all those transitions that bring particles from some other displacement xf 
at time t to the displacement x and, hence, represent a net gain to the function/(x, t). u 

The structure of the Master Equation is thus founded on very simple and straightfor¬ 
ward premises. Of course, under certain conditions, this equation, or any generalization 
thereof (such as the one including velocity, or momentum, coordinates in the argument of 
/), can be reduced to the simple form 


3/ f~fo 

dt r 


(3) 


which has proved to be a very useful first approximation for studying problems related 
to transport phenomena. Elere, f 0 denotes the equilibrium distribution function (for 
3 //dt — 0 when f —fo), while r is the relaxation time that determines the rate at which 
the fluctuations in the system drive it to a state of equilibrium. 

In studying Brownian motion on the basis of equation (2), we can safely assume that 
it is only transitions between “closely neighboring” states x and x! that have an apprecia¬ 
ble probability of occurring; in other words, the transition probability function W(x,x') is 
sharply peaked around the value xf = x and falls rapidly to zero away from x. Denoting the 
interval (xf — x) by f, we may write 


W(x,xf) -> W(x;£), W(xf,x) W(x';-£) (4) 

13 We are tacitly assuming here a "Markovian” situation where the transition probability function W(x,x') depends 
only on the present position x (and, of course, the subsequent position x') of the particle but not on the previous history 
of the particle. 

14 In the case of fermions, an account must be taken of the Pauli exclusion principle, which controls the “occupation 
of single-particle states in the system”; for instance, we cannot, in that case, consider a transition that tends to transfer a 
particle to a state that is already occupied. This requires an appropriate modification of the Master Equation. 
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where W{x ;$) and W(x';—$) have sharp peaks around the value £ — 0 and fall rapidly to 
zero elsewhere. 15 This enables us to expand the right side of (2) as a Taylor series around 
£ = 0. Retaining terms up to second order only, we obtain 

9 f(x,t) d „ Id 2 

a . = - t-{mi t)\ + -—-{ t i 2 {x)f{x, t)}, ( 5 ) 

at ox z ox* 

where 


OO 

r , (8x)st 

Mi(x) = I %W(x-,%)d% = —— - (v x ) 

— OO 


( 6 ) 


and 


OO 

M 2(x)= J $ 2 W(X]t;)d% = 


mr)»t 

st 


(7) 


Equation (5) is the so-called Fokker-Planck equation, which occupies a classic place in the 
field of Brownian motion and fluctuations. 

We now consider a specific system of Brownian particles (of negligible mass), each par¬ 
ticle being acted on by a linear restoring force, F x = —ax, and having mobility B in the 
surrounding medium; the assumption of negligible mass implies that the relaxation time 
r(= MB) of equation (15.3.4) is very small, so the time t here may be regarded as very large 
in comparison with that r. The mean viscous force, — (v x ) /B, is then balanced by the linear 
restoring force, with the result that 


{Vx) 

B 


+ F x — 0 


( 8 ) 


and hence 


{v x ) = mix) = -XBx. 


(9) 


Next, in view of equation (15.3.10), we have 

{ -^p-= IX2 (x) = 2BkT ] ( 10 ) 

it will be noted that the influence of X on this quantity is being neglected here. Substituting 
(9) and (10) into (5), we obtain 


— = XB—(xf) + BkT —t . (11) 

dt dx J dx 2 

15 Clearly, this assumption limits our analysis to what may be called the “Brownian motion approximation," in which 
the object under consideration is presumed to be on a very different scale of magnitude than the molecules constituting 
the environment. It is obvious that if one tries to apply this sort of analysis to “understand” the behavior of molecules 
themselves, one cannot hope for anything but a “crude, semiquantitative” outcome. 
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Now we apply equation (11) to an “ensemble” of Brownian particles, initially concen¬ 
trated at the point x = xo- To begin with, we note that, in the absence of the restoring force 
(A. = 0), equation (11) reduces to the one-dimensional diffusion equation 

d f=D d -X (D = BkT), (12) 

dt dx 2 

which conforms to our earlier results (15.2.19) and (15.3.11). The present derivation shows 
that the process of diffusion is essentially a “random walk, at the molecular level.” In view 
of equation (15.2.20), the function/(x, t) here would be 




(X-Xp) 2 ) 

4 Dt j 


(13) 


with 


x = xo and x 2 = Xg + 2 Dt; 


(14) 


the last result shows that the mean square distance traversed by the particle (s) increases 
linearly with time, without any upper limit on its value. The restoring force, however, puts 
a check on the diffusive tendency of the particles. For instance, in the presence of such 
a force (1 ^ 0), the terminal distribution/oo (for which df/dt = 0) is determined by the 
equation 


d 

dX 


(x/oo) + 


kT d 2 foo 
x dx 2 


= o, 


(15) 


which gives 


, / x y / 2 

&w= Uatf) exp 



(16) 


with 


x = 0 and x 2 = kT/k. (17) 

The last result agrees with the fact that the mean square value of x must ultimately comply 
with the equipartition theorem, namely (^kx 2 )^ = \kT. From the point of view of equilib¬ 
rium statistical mechanics, if we regard Brownian particles with kinetic energy p 2 /2m and 
potential energy \kx 2 as loosely coupled to a thermal environment at temperature T, then 
we may directly write 

feq(x, p x )dxdp x oc e -<Px/ 2 m +^ : * 2 / 2 )l /cT dxdp x . (18) 

On integration over p x , expression (18) leads directly to the distribution function (16). 
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FIGURE 15.5 The distribution function (19) at times t = 0, t = 1/(2 XB), and t. = oo. 


The general solution of equation (11), relevant to the ensemble under consideration, is 
given by 


/(x, t) = 


A )h2 _ 

2nkT(l - e - 2} lil ) J 6XP 


X(x — Xoe XBt ) 2 | 
2kT(l - e ~ 2XBt ) j ’ 


with 


x = x 0 e kBt and x 2 = xle 2lBt +—(\~ 


„-2XBK 


(19) 


( 20 ) 


in the limit X -»• 0, we recover the purely “diffusive” situation, as described by equa¬ 
tions (13) and (14), while for t » {XB)~ l , we approach the “terminal” situation, as 
described by equations (16) and (17). Figure 15.5 shows the manner in which an ensem¬ 
ble of Brownian particles approaches a state of equilibrium under the combined influence 
of the restoring force and the molecular bombardment; clearly, the relaxation time of the 
present process is ~ ( XB ) _1 . 

A physical system to which the foregoing theory is readily applicable is provided by the 
oscillating component of a moving-coil galvanometer. Here, we have a coil of wire and a 
mirror that are suspended by a fine fiber, so they can rotate about a vertical axis. Random, 
incessant collisions of air molecules with the suspended system produce a succession of 
torques of fluctuating intensity; as a result, the angular position 6 of the system continu¬ 
ally fluctuates and the system exhibits an unsteady zero. This is clearly another example of 
the Brownian motion! The role of the viscous force in this case is played by the mechanism 
of air damping (or, else, electromagnetic damping) of the galvanometer, while the restor¬ 
ing torque, No — —c6, arises from the torsional properties of the fiber. In equilibrium, we 
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expect that 



1 9 \ 1, ^ kT 

-c6 2 ) = -kT, that is, 9 2 = — 
2/2 c 


(21) 


compare this to equation (17). An experimental determination of the mean square deflec¬ 
tion, 0 2 , of such a system was made by Kappler (1931) who, in turn, applied his results to 
derive, with the help of equation (21), an empirical value for the Boltzmann constant k (or, 
for that matter, the Avogadro number Na ). The system used by Kappler had a moment of 
inertia / = 4.552 x 10 _4 gcm 2 and a time period of oscillation r = 1379s; accordingly, the 
constant c of the restoring torque had a value given by the formula r = 2jt(I/c) ]/2 , so that 


c = 47r 2 (//r 2 ) = 9.443 x 10 9 gcm 2 s 2 /rad. 


The observed value of 9 2 , at a temperature of 287.1 K, was 4.178 x 10 -6 . Substituting these 
numbers in (21), Kappler obtained: k — 1.374 x 10 -16 erg K _1 . And, since the gas constant 
R is equal to 8.31 x 10 7 erg K _1 mole _1 , he obtained for the Avogadro number: Na — R/k = 
6.06 x 10 23 mole -1 . 

One might expect that by suspending the mirror system in an “evacuated” casing the 
fluctuations caused by the collisions of the air molecules could be severely reduced. This 
is not true because even at the lowest possible pressures there still remain a tremendously 
large number of molecules in the system that keep the Brownian motion “alive.” The inter¬ 
esting part of the story, however, is that the mean square deflection of the system, caused 
by molecular bombardment, is not at all affected by the density of the molecules; for a sys¬ 
tem in equilibrium, it is determined solely by the temperature. This situation is depicted, 
rather dramatically, in Figure 15.6 where we have two traces of oscillations of the mir¬ 
ror system, the upper one having been taken at the atmospheric pressure and the lower 
one at a pressure of 10 -4 mm of mercury. The root-mean-square deviation is very nearly 
the same in the two cases! Nevertheless, one does note a difference of “quality” between 
the two traces that relates to the “frequency spectrum” of the fluctuations and arises 
for the following reason. When the density of the surrounding gas is relatively high, the 
molecular impulses come in rapid succession, with the result that the individual deflec¬ 
tions of the system are large in number but small in magnitude. As the pressure is lowered, 
the time intervals between successive impulses become longer, making the individual 
deflections smaller in number but larger in magnitude. Flowever, the overall deflection, 
observed over a long interval of time, remains essentially the same. 




FIGURE 15.6 Two traces of the thermal oscillations of a mirror system suspended in air; the upper trace was taken 
at the atmospheric pressure, the lower one at a pressure of 1CT 4 mm of mercury. 
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15.5 Spectral analysis of fluctuations: the 
Wiener-Khintchine theorem 


We have already made reference to the (spectral) quality of a fluctuation pattern. Refer¬ 
ring once again to the patterns shown in Figure 15.6, we note that, even though the mean 
square fluctuation of the variable 0 is the same in the two cases, the second pattern is 
far more “jagged” than the first; in other words, the high-frequency components are far 
more prominent in the second pattern. At the same time, there is a lot more “predictabil¬ 
ity” in the first pattern (insofar as it is represented by a much smoother curve); in other 
words, the correlation function, or the memory function, K(s ) of the first pattern extends 
over much larger values of s. In fact, these two aspects of a fluctuation process, namely its 
time-dependence and its frequency spectrum, are very closely related to one another. And 
the most natural course for studying this relationship is to carry out a Fourier analysis of 
the given process. 

For this study we consider only those variables, y(t), whose mean square value, ( y 2 (t)), 
has already attained an equilibrium, or stationary, value: 


(y 2 (t)) = const. 


( 1 ) 


Such a variable is said to be statistically stationary. As an example of such a variable, we 
may recall the velocity v(t) of a “free” Brownian particle at times t much larger than the 
relaxation time r, see equation (15.3.29), or the displacement x(t) of a Brownian particle 
moving under the influence of a restoring force (F x — —ax) at times t much larger than 
see equation (15.4.20). Now, if the variable y(t) were strictly periodic (and hence 
completely predictable), with a time period T — l//o, then we could write 


OO OO 


y(t) = a 0 + n cos(2n nfot) + ^ b n sm(2nnfot) 


( 2 ) 


n= 1 72=1 


where 


T 



(3) 


0 

T 


a n = y J y(t)cos(2nnf 0 t)dt, 
o 


(4) 


and 


T 



(5) 


0 


in this case, the coefficients a and b would be completely known and would define, with 
no uncertainty, the frequency spectrum of the variable y(t ). If, on the other hand, the given 
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variable is more or less a random function of time, then the coefficients a and h would 
themselves be statistical in nature. To apply the concept of periodicity to such a function, 
we must take the “time interval of repetition” to be infinitely large, that is, we let/o -*■ 0. 

In the proposed limit, equation (3) would read 

T 

ao = Lim - f y(t)dt= (y(t)}; (6) 

T^oo 1 J 
0 

thus, the coefficient ao, which represents the mean (or d.c.) value of the variable y, may be 
determined either by taking a time average (over a sufficiently long interval) of the variable 
or by taking an ensemble average (at any instant of time t ). For convenience, and without 
loss of generality, we take a 0 = 0; in other words, we assume that from the actual values of 
the variable y(t) its mean value, (y(f)), has already been subtracted. 16 

Taking the ensemble average of equations (4) and (5), we obtain, for all n, 


(Cln) — {bn) — 0. 


(7) 


However, by taking the ensemble average of equation (2) squared, we obtain 

n n 

= J2 \ {} = const ® 

n 

The term \{{a\) + (b 2 )} represents the respective “share,” belonging to the frequency nfo, 
in the total, time-independent value of the quantity (y 2 (t)). Now, in view of the random¬ 
ness of the phases of the various components, we have, for all n, (a 2 ) = (h 2 ); consequently, 
equation (8) may be written as 


<y 


CXJ 

= !]<««> - [ 

n J 


w{f)df, 


(9) 


where 

(a 2 ) = w(nfo)A(nf 0 ), that is, w(nf 0 ) = T {a 2 n )\ (10) 

Jo 

the function w(f) defines the power spectrum of the variable y(f). 

We shall now show that the power spectrum w{f) of the fluctuating variable y(t) is 
completely determined by its autocorrelation function K(s). For this, we make use of 

16 Obviously, this does not affect the spectral quality of the fluctuations, except that now we do not have a compo¬ 
nent with frequency zero. To represent the actual situation, one may have to add, to the resulting spectrum, a suitably 
weighted S (/) -term. 
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equation (4), which gives 


l//o l//o 


(«n> 


= */c 


•// 

0 0 


(y(fi )y(f 2 )> cos(2n nfot\) cos(2n nfot 2 )dt\dt 2 . 


( 11 ) 


Changing over to the variables 

S = -(h +f 2 ) and s=(t 2 - t\), 

and remembering that the interval T over which the integrations extend is much larger 
than the duration over which the “memory” of the variable y lasts, we obtain 

l//o OO 

(a 2 ,) ~ 2/ 0 2 f £ K(s)[cos(2nnfos) + cos(4nnfoS)}dSds; (12) 

S=0s=-oo 

compare this to the steps that led us from equations (15.3.22) to (15.3.25) and (15.3.26). 
The second part of the integral in (12) vanishes on integration over S; the first part then 
gives 


OO 

(af,)=4/o J K(s)cos(27infos)ds. (13) 

o 

Comparing (13) with (10), we obtain the desired formula 

OO 

w(f) = 4 J K(s)cos(2nfs)ds. (14) 

o 

Taking the inverse of (14), we obtain 

OO 

K(s) = J w(f) cos(27r fs)df. (15) 

o 

For s = 0, formula (15) yields the important relationship 

OO 

K(0) = J w(f)df = {y 2 )-, (16) 

o 

see equation (9) as well as the definition of the autocorrelation function of the variable y, 
namely K(s) — (y(fi)y(fi + s)). Equations (14) and (15), which connect the complementary 
functions w{f) and K(s), constitute a theorem that goes after the names of Wiener (1930) 
and Khintchine (1934). 
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We shall now look at some special cases of the variable y(f) to illustrate the use of the 
Wiener-Khintchine theorem. 

Case 1 

If the given variable y(t) is extremely irregular, and hence unpredictable, then its correla¬ 
tion function K(s) would extend over a negligibly small range of the time interval s. 1T We 
may then write 

K(s) = cS(s). (17a) 


Equation (14) then gives 


w(f) — 2c for all /. (17b) 

A spectrum in which the distribution (of power) over different frequencies is uniform is 
known as a “flat” or a “white” spectrum. We note, however, that if the uniformity of distri¬ 
bution were literally true for all frequencies, from 0 to oo, then the integral in (16), which 
is identically equal to (y 2 ), would diverge! We, therefore, expect that, in any realistic sit¬ 
uation, the correlation function K(s) will not be as sharply peaked as in (17a). Typically, 
K{s ) will extend over a small range, O(cr), of the variable s, which in turn will define a 
“frequency zone,” with J'~ 0( 1/er), such that the function w(f) would undergo a change 
of character as / passes through this zone; toward lower frequencies, w(f) -»■ const. / 0, 
while toward higher frequencies, w(f ) -»■ const. = 0. One possible representation of this 
situation is shown in Figure 15.7 where we have taken, rather arbitrarily, 

sin(fls) 

K(s) = K( 0)--— (a > 0), (18a) 

as 

for which 

mf) = —K( 0) for f <y~ 

a 2n ( 18b ) 

„ „ a 

0 for / > —. 

J 2n 

In the limit a oo, equations (18) reduce to (17), with c = jra~ 1 K(0). 

Case 2 

On the other hand, if the variable y(f) is extremely regular, and hence predictable, then its 
correlation function would extend over large values of s; its power spectrum would then 
appear in the form of “peaks,” located at certain “characteristic frequencies” of the vari¬ 
able. In the simplest case of a monochromatic variable, with characteristic frequency/*, 
the correlation function would be 


K(s) = K{ 0) cos(2jt f*s), 


(19a) 


17 This is essentially true of the rapidly fluctuating force F( t) experienced by a Brownian particle due to the incessant 
molecular impulses received by it. 
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FIGURE 15.7 The autocorrelation function K(s) and the power distribution function w(f) of a given variable y(t)\ 
the parameter a appears in terms of an arbitrary unit of (time)" 1 . 


for which 


w(f) = K(Q)8(f —f*Y, (19b) 

see Figure 15.8. Avery special case arises when/* = 0; then, bothy(t) andfC(s) are constant 
in value, and the function w(f) is peaked at the d.c. frequency/ = 0. 

Case 3 

If the variable y(t) represents a signal that arises from, or has been filtered through, a lightly 
damped tuned circuit (a “narrowband” filter), then its power will be distributed over a 
“hump” around the mean frequency/*. The function K(s) will then appear in the nature of 
an “attenuated” function whose time scale, a, is determined by the width, A/, of the hump 
in the power spectrum. A situation of this kind is shown in Figure 15.9. 

The relevance of spectral analysis to the problem of the actual observation of a fluc¬ 
tuating variable is best brought out by examining the power spectrum of the velocity v(t) 
of a Brownian particle. Considering the x-component alone, the autocorrelation function 
K t , x (s), or simply K (s), in this case is given by 

K(s) = — 

M 


(t = MB)] 


( 20 ) 
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FIGURE 15.8 The autocorrelation function K(s) and the power distribution function w(f) of a monochromatic 
variable y(t), with characteristic frequency/*. 


see equation (15.6.10). The power spectrum w(f) is then given by the expression 


4 kT C 

w(f) = / e~ s ^ T cos(2itfs)ds - 

o 

which indeed satisfies the relationship 


4/cTr 


M 1 + (2?r/r) 2 ’ 


( 21 ) 


/ 2kT i 

w(f)df = — tan -1 (2ti fr) | 

o 

kT y 

= — = {vz), 

M x 


( 22 ) 


in agreement with the equipartition theorem (as applied to a single component of the 
velocity v). For/«;r -1 , the power distribution is practically independent of/, which 
implies a practically “white” spectrum, with 


w(f ) - = 4 BkT. 


(23) 


M 












15.5 Spectral analysis of fluctuations: the Wiener-Khintchine theorem 615 


K(s) 




FIGURE 15.9 The autocorrelation function K(s) and the power distribution function w(f ) of a variable that has 
been filtered through a lightly damped tuned circuit, with mean frequency/* and width A/~ (1/tr). 


We can then write for the velocity fluctuations in the frequency range (/,/+ A/), with 
f«r-\ 


- w (f) A f - (4BkT)Af. (24) 

In general, our measuring instrument (or the eye, in the case of a visual examination of 
the particle) has a finite response time r 0 , as a consequence ofwhichitis unable to respond 
to frequencies larger than, say, r^ 1 . The observed fluctuation is then given by the “pruned” 
expression 


l/TO 

/ 2kT / t \ 

w(f )df = — tam 1 f: 2n — J , (25) 

o 

instead of the “full” expression (22). In a typical case, the mass of the Brownian particle 
M~ 10 -12 g, its diameter 2a ~ 10 -4 cm, and the coefficient of viscosity of the fluid t] ~ 10 -2 
poise, so that the relaxation time r — M/(6jn]a) ~ 10“' seconds. However, the response 
time tq, in the case of visual observation, is of the order of 10 _1 s; clearly, r/to ~ 10“ 6 1. 
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Equation (25) then reduces to 18 


(Cdobs 


4/cT r kT 

-77— « T7 ’ 
M r 0 M 


(26) 


thus, in view of the finiteness of the response time to, the observed root-mean-square 
velocity of the Brownian particle will be down by a factor of 2(r/ro) 1/2 ~ 10 -3 ; numeri¬ 
cally, this takes us down from a root-mean-square velocity which, at room temperatures, 
is ~ 10 -1 cm/s to a value ~ 10 -4 cm/s. 

It is gratifying to note that the outcome of actual observations of Brownian particles is 
in complete agreement with the latter result; for a more detailed analysis of this question, 
see MacDonald (1950). The foregoing discussion highlights the fact that, in the process 
of observing a fluctuating variable, our measuring instrument picks up signals over only 
a limited range of frequencies (as determined by the response time of the instrument); 
signals belonging to higher frequencies are simply left out. 

The theory of this section can be readily applied to fluctuations in the motion of elec¬ 
trons in an (L, R) circuit. Corresponding to equations (21) through (24), we now have for 
fluctuations in the electric current I 


w(f) = 


4fc7Y 1 
L 1 + (2tt fr') 2 



(27) 


so that 


/ kT 

Mf)df=— = (I 2 ), (28) 

0 

in agreement with the equipartition theorem: {\LI 2 ) = \kT. For /<$: 1/r', equation (27) 
reduces to 

„ 4 kT 

w{f) — — , (29) 

K 

which, once again, implies “white” noise; accordingly, for low frequencies, 

AkT 

< a/ 2 >(/,/+a/) - “T/W - ( 30) 

Equivalently, we obtain for fluctuations in the voltage 


(A7 2 ), /i/+a/) ~ (4RkT)Af. (31) 

Equation (31) constitutes the so-called Nyquist theorem, which was first discovered empir¬ 
ically by Johnson (1927a,b; 1928) and was later derived by Nyquist (1927-1928) on the basis 

18 The fluctuations constituting this result belong entirely to the region of the "white" noise, with A/ = 1/ro; see 
equation (24), with B = r/M. 
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of an argument involving the second law of thermodynamics and the exchange of energy 
between two resistances in thermal equilibrium. 19 


15.6 The fluctuation-dissipation theorem 

In Section 15.3 we obtained a result of considerable importance, namely 


1 

B 


M 

T 


M 2 _ M 2 
6 kf L ~ 6 kT 


OO 

J K A (s)ds 

—OO 


1 

6 IcT 


OO 

J Kp(s)ds; 

— OO 


( 1 ) 


see equations (15.3.4), (15.3.26), and (15.3.28). Here, K A (s) and K F (s) are, respectively, the 
autocorrelation functions of the fluctuating acceleration .A(f) and the fluctuating force F(t) 
experienced by the Brownian particle: 

K a (s) = (A( 0) • A(s)) = ^ (F( 0) • F(s)) = ^ f (s). (2) 20 

Equation (1) establishes a fundamental relationship between the coefficient, 1/5, of the 
“averaged-out” part of the total force & (t) experienced by the Brownian particle due to the 
impacts of the fluid molecules and the statistical character of the “fluctuating” part, F(t), 
of that force; see Langevin’s equation (15.3.2). In other words, it relates the coefficient of 
viscosity of the fluid, which represents dissipative forces operating in the system, with the 
temporal character of the molecular fluctuations', the content of equation (1) is, therefore, 
referred to as a fluctuation-dissipation theorem. 

The most striking feature of this theorem is that it relates, in a fundamental manner, 
the fluctuations of a physical quantity pertaining to the equilibrium state of a given system 
to a dissipative process which, in practice, is realized only when the system is subject to an 
external force that drives it away from equilibrium. Consequently, it enables us to deter¬ 
mine the nonequilibrium properties of the given system on the basis of a knowledge of the 
thermal fluctuations occurring in the system when the system is in one of its equilibrium 

19 We note that the foregoing results are essentially equivalent to Einstein’s original result for charge fluctuations in a 
conductor, namely 

(8q 2 ),= 2j ft ; 

compare, as well, the Brownian-particle result: ( x 2 ) t = 2 BkTt. 

20 We note that the functions K A (s) and Kp f.s), which are nonzero only for s = 0(r*), see equation (15.3.21), may, for 
certain purposes, be written as 

ktOO = and K P (s) = ^fs(s). 


In this form, the functions are nonzero only for s = 0. 
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states'. For an expository account of the fluctuation-dissipation theorem, the reader may 
refer to Kubo (1966). 

At this stage we recall that in equation (15.3.11) we obtained a relationship between 
the diffusion coefficient D and the mobility B, namely D — BkT. Combining this with 
equation (1), we get 


OO 

1 If 

— = —-—=- / Kp(s)ds. (3) 

D 6(kT) 2 J 

— OO 

Now, the diffusion coefficient D can be related directly to the autocorrelation function 
K v (s) of the fluctuating variable v(t). For this, one starts with the observation that, by 
definition, 

t 

r{t) = J v{u)du, (4) 

o 

which gives 


L l 

{r 2 (t)) = J J (v(ui) ■ v(u 2 ))duidu 2 . 
o o 

Proceeding in the same manner as for the integral in equation (15.3.22), one obtains 

t/2 +2S t +2 (t-S) 

(r 2 (t)) = 


(5) 


it)) = J dS J K v (s)ds+ j dS J K v (s)ds; 


( 6 ) 


0 —2 S 


f/2 —2(t—S) 


compare this to equation (15.3.24). 

The function K v (s) can be determined by making use of expression (15.3.14) for v(t) 
and following exactly the same procedure as for determining the quantity {v 2 (t)), which is 
nothing but the maximal value, K v ( 0), of the desired function. Thus, one obtains 


K v (s) — 


compare these results to equation (15.3.27). It is easily seen that formulae (7)and (8) can 
be combined into a single one, namely 


v 2 (0)e -( 2 t+ s )/T + _ g-2»/rj 

for 

s > 0 

(7) 

v 2 (0)e~ {2t+s)/z + ^^e s/T (l - e ~ 2(t+s)/T ) 

for 

s < 0; 

(8) 


K v (s) = v 2 (0)e |S|/T + |-j^r - c 2 (0)J (e |s| / T -e (2f+s)A) for all s; 


(9) 




15.6 The fluctuation-dissipation theorem 619 


compare this to equation (15.3.29). In the case of a “stationary ensemble,” 

3 kT I.., 

K v {s)=—e-W\ ( 10 ) 

which is consistent with property (15.3.20). It should be noted that the time scale for the 
correlation function K v (s) is provided by the relaxation time r of the Brownian motion, 
which is many orders of magnitude larger than the characteristic time r* that provides the 
time scale for the correlation functions Ka(s) and Kp(s). 

It is now instructive to verify that the substitution of expression (10) into (6) leads to 
formula (15.3.7) for (r 2 ), while the substitution of the more general expression (9) leads to 
formula (15.3.31); see Problem 15.17. In either case, 


(r 2 )-► &Dt. 

t»T 


(ID 


In the same limit, equation (6) reduces to 

t oo oo 

(r 2 ) ~ J dS J K v (s)ds=t J K v (s)ds. 

0 — oo —oo 

Comparing the two results, we obtain the desired relationship: 

OO 

D= ^ J K v (s)ds. 

— OO 

In passing, we note, from equations (3) and (13), that 

OO OO 

J K v (s)ds J Kp(s)ds = (6fcT) 2 ; 


( 12 ) 


(13) 


(14) 


-OO —OO 


see also Problem 15.7. 

It is not surprising that the equations describing a fluctuation-dissipation theorem can 
be adapted to any situation that involves a dissipative mechanism. For instance, fluctua¬ 
tions in the motion of electrons in an electric resistor give rise to a “spontaneous” thermal 
e.m.f., which may be denoted as 53(1). In the spirit of the Langevin theory, this e.m.f. may 
be split into two parts: (i) an “averaged-out” part, —RI(t), which represents the resistive (or 
dissipative) aspect of the situation, and (ii) a “rapidly fluctuating” part, V{t), which, over 
long intervals of time, averages out to zero. The “spontaneous” current in the resistor is 
then given by the equation 


L—=-RI+V(ty, (V{t)) = 0. 


(15) 
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Comparing this with the Langevin equation (15.3.2) and pushing the analogy further, we 
infer that there exists a direct relationship between the resistance R and the temporal 
character of the fluctuations in the variable V(t). In view of equations (1) and (13), this 
relationship would be 

OO 

R= 6kT / (V(0) y(s)>& ( 16 ) 

— OO 

or, equivalently, 


1 R = mJ mmds - (17 > 

— OO 

A generalization of the foregoing result has been given by Kubo (1957, 1959); see, for 
instance, Problem 6.19 in Kubo (1965), or Section 23.2 of Wannier (1966). On generaliza¬ 
tion, the electric current density j(t ) is given by the expression 

t 

ji(t) = J2 f Ei(t')&u(t — f)dlf ( i,l = x,y,z ); (18) 

l -OO 

here, E(t) denotes the applied electric field while 

®ii{s) = ~ljmji(s)). (19) 

Clearly, the quantities kT<&n(s) are the components of the autocorrelation tensor of the 
fluctuating vector j(t). In particular, if we consider the static case E — (E, 0,0), we obtain 
for the conductivity of the system 


t OO 

ffxx= J -^= J <&xx{t- t')dt' = J <i>xx(s)ds 

-oo 0 

OO 

= / Ux(0)jx(s))ds, (20) 

— OO 

which may be compared with equation (17). If, on the other hand, we take E = 
(E cos cot , 0,0), we obtain instead 


OO 

ffxxM = 2 ^. r J Ux(0)jx(s))e- la>s ds. 

— OO 


( 21 ) 
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Taking the inverse of (21), we get 


OO 

kT C 

(jx(O)jx(s)) = — / Oxx( 0 J)e ta,s dax (22) 

—OO 

If we now assume that o xx (co ) does not depend on ® (and may, therefore, be denoted by the 
simpler symbol er), then 


(fc(0)/*(s)> = ( 2kTo)8(sy, (23) 

see footnote 20. A reference to equations (15.5.17) shows that, in the present approxima¬ 
tion, thermal fluctuations in the electric current are charaterized by a “white” noise. 

15.6.A Derivation of the fluctuation-dissipation theorem 
from linear response theory 

In this section we will show that the nonequilibrium response of a thermodynamic system 
to a small driving force is very generally related to the time-dependence of equilibrium 
fluctuations. In hindsight, this is not too surprising since natural fluctuations about the 
equilibrium state also induce small deviations of observables from their average val¬ 
ues. The response of the system to these natural fluctuations should be the same as the 
response of the system to deviations from the equilibrium state as caused by small driving 
forces; see Martin (1968), Forster (1975), and Mazenko (2006). 

Let us compute the time-dependent changes to an observable A caused by a small 
time-dependent external applied field h(t ) that couples linearly to some observable B. The 
Hamiltonian for the system then becomes 


H(t) =H 0 - h(t)B, (24) 

where Ho is the unperturbed Hamiltonian in the equilibrium state. Remarkably, the calcu¬ 
lation for determining the nonequilibrium response to the driving held is easiest using the 
quantum-mechanical density matrix approach developed in Section 5.1. The equilibrium 
density matrix is given by 


- _ exp(-/?H 0 ) 

Peq “ Tr(exp(-/JH 0 ))' 

where equilibrium averages involve traces over the density matrix: 


(25) 


<A) eq = Tr(AS eq ). ( 26 ) 

When the Hamiltonian includes a small time-dependent held h(t), then this additional 
term drives the system slightly out of equilibrium. We will assume that the held was zero 
in the distant past so the system was initially in the equilibrium state dehned by the 
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Hamiltonian Hq. We then turn on the field and measure the time-dependent deviations 
of the observable A from its equilibrium value. The small time-dependent applied field 
h(t ) induces a small change to the density matrix 


P(t) — p e q +8p(t). (27) 

The equation of motion of the density matrix in equation (5.1.10) gives 

I = w = s dm “ m 

where [, ] denotes the commutator. Since we are considering only the linear response of 
the system to the applied field, we have ignored the higher-order term proportional to 
h(t ) [B, S/o] . Solving (28) for the time-dependent change to the density matrix, we get 

<5/6(0 = J L f h (t')ex p ^ lH ° { * —— j [B,p eq ] exp t) J dt > (29) 

— OO 

This form uses the interaction representation in which operators evolve in time due to the 
unperturbed Hamiltonian H 0 . We can use the change in the density matrix at time t to 
calculate the change in the observable A compared to its equilibrium value, namely 


(SA(t)) = <A(f)> - (A) eq = Tr (Ap(t)) ~ Tr (Ap e q) = Tr (A8p(t)) . (30) 

Using the cyclic property of traces, Tr (QRS) — Tr (SQR), we find that {8A{t)) is the convolu¬ 
tion of a response function with the applied field: 


t 

(<«A(f))= l - f {[A(t),B{t')]) eOL h{t')dt'. 

— OO 


(31) 


Note that this nonequilibrium response function of the system to the driving force 
depends on the equilibrium average of the commutator of the observables A and B at dif¬ 
ferent times. The effect of the field on the observable A is causal since (8Ad)} depends 
only on the applied field at earlier times. Since the relation is linear, time-translationally 
invariant, and causal, the Fourier spectra of (8A) and h, namely 


"/ 


8A(a>) = / ( 8A(t )) e llot dt, and 


"/ 


h(a>) = / h(t)e lu>t dt, 


(32a) 


(32b) 


are related by 


8A(a>) = (x^fw) + «Xab(<w)) h(a >) 


( 33 ) 
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where x^g(«) is given by 

OO 

X> ) = ^/ HA(t),B(0)]) eq e ia>t dt. 

— OO 


(34) 


The quantity x^C&O is given by the Kramers-Kronig relation using the principal part P of 
an infinite integral over x^g(<w): 


OO 

X^b(m) = P J 

—OO 


da' 
a>' — a> tc 


(35) 


If A and B are the same operator, then x^fw) and x^(co) are, respectively, the real and 
imaginary parts of the response function. If A and B have the same symmetry under time- 
reversal, is real and is an even function of o>. For a set of observables A{, the set of 

response functions &>x-j(<w) form a symmetric positive matrix that gives the rate of energy 
dissipation due to the external driving forces. See fackson (1999) for a general causality 
discussion and Martin (1968), Forster (1975), or Mazenko (2006) for the details of this 
calculation. 

Now we consider the equilibrium temporal correlations between the fluctuations of the 
observables A and B, namely 


G AB (t-t') = (8A(t)8B(t')) eq . (36) 

At equal times, this measures the AB equilibrium fluctuations described in Section 15.1, 
that is, Gab(0) = ( SASB ) eq . The power spectrum of the AB equilibrium fluctuations is 
defined by 

OO OO 

Sab(co)= f GAB(t)e iMt dt= J {8A(t)8B(0)) e q e irat dt. (37) 

— OO —OO 

The similarity between the forms of Sab(oj) and x^g(a>) in equations (34) and (37) leads 
to an important relation between the power spectrum and the linear response function, 
namely the fluctuation-dissipation theorem: 

x'a B (oj) = i (l - SabIco). (38) 

The power spectrum Sab(co) measures equilibrium fluctuations whereas Xab(«) is propor¬ 
tional to the average rate of power dissipation that results from the time-varying applied 
held. The classical limit of the fluctuation-dissipation theorem is obtained by letting 
ha>/kT -» 0 with the result 

Xa H (oA = ^fSABico); (39) 

compare this to equation (15.3.45). More complete discussions of the fluctuation- 
dissipation theorem can be found in Martin (1968), Forster (1975), and Mazenko (2006). 
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15.6.B Inelastic scattering 

Inelastic scattering is an important experimental technique for measuring the dynamical 
behavior of materials. By measuring the intensity of radiation scattered from a sample as a 
function of wavevector transfer and frequency change relative to the incident monochro¬ 
matic radiation source, one can measure the spatio-temporal correlations in the material. 
This technique is now commonly applied to the scattering of neutrons, electrons, light, 
and x-rays. The frequency changes in the scattered wave are caused by inelastic scattering 
from quantum excitations in the sample; see Forster (1975), Squires (1997), and Mazenko 
(2006). 

The frequency-dependent scattering intensity is directly proportional to the dynamical 
structure factor 


S(k,a>) = 



-ik-(r t (t)-rH0)) 


e imt dt, 


(40) 


where 1c represents the wavevector transfer of the scattering process and a> represents the 
frequency difference from that of the incident beam. A positive value of co corresponds to 
a detected frequency that is less than the frequency of the incident beam. The dynamical 
structure factor S(k,co) encodes both the spatial and temporal equilibrium correlations of 
fluctuations in the material and can be decomposed into two terms, one that represents 
scattering from a single particle at different times and another that represents scattering 
from different particles: 


S(lc,a>) — S se if (fc, a>) + S cn herenl dr, (o ), 

1 °° / \ 

df (fc )W)= I J l^g-ikirM-rimyuot^ 

-nn ' i I 


Scoherent(fc,«) = ^ f fee-*™-'!™ )<?>**. 

-L \ ¥j 


(41a) 

(41b) 


(41c) 


The dynamical structure factor S(k,co) can also be written in terms of the spatio- 
temporal Fourier transforms of the time-dependent density n(r, t) to connect it to the 
power spectrum as defined in Section 15.6.A: 


OO 


S(k,a>) — Sab(o>) = J {8A(t)8B(0))e iwt dt, 

(42a) 

1 r ]_ 

8A{t)= —= / e~ lkr 8n(r,t)dr = 

viV J viV 

(42b) 

1C 1 

8B{ 0) = / e ikr 8n(r,0)dr= -—n k (0). 

\ N J v N 

(42c) 
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Positive frequency changes a>> 0 represent scattering events that create quantum exci¬ 
tations in the material with energy hco and are referred to as Stokes scattering. Negative 
frequency changes are called anti-Stokes scattering and correspond to scattering events 
that destroy an excitation in the material with energy hco. Since an excitation must first 
exist in order for it to be destroyed, the anti-Stokes scattering rate in equilibrium is lower 
relative to the Stokes scattering rate by the Boltzmann factor for the excitation: 


S(k,-a)) = e- f>tUo S(k,co). 


(43) 


This relation is sometimes used in Raman scattering to measure the temperature of the 
sample. The heights of Stokes and anti-Stokes peaks are symmetric if the excitation ener¬ 
gies are small compared to thermal energies, that is hco <<: kT. The static structure factor 
S(1c) in equation (10.7.18) is obtained by integrating S(k,o>) over all o>: 


OO 



(44) 


—OO 


This singles out equal-time scattering events and corresponds to quasielastic scattering 
measurements that are unable to resolve the energy changes due to the excitations in the 
material. 

Three commonly measured types of inelastic laser scattering are: Raman scattering, 
Brillouin scattering, and Rayleigh scattering. Raman scattering measures electronic, vibra¬ 
tional, and rotational excitations of atoms and molecules, electronic band structure, and 
optical phonon modes. Brillouin scattering measures long-wavelength acoustic sound 
modes. The widths of Raman and Brillouin scattering peaks are determined by the life¬ 
times of their respective modes. Rayleigh scattering measures the heat diffusion mode 
centered at co — 0 with width proportional to the thermal diffusivity. The wavelength of 
visible light is large compared to atomic scales, so the wavevector transfers possible with 
light scattering are very small compared to the size of the Brillouin zone. This limita¬ 
tion is removed for inelastic x-ray and neutron scattering where experiments can probe 
wavevectors away from the center of the Brillouin zone. 

The dynamical scattering of a laser beam from a liquid includes three peaks: a Rayleigh 
peak centered at to — 0 due to scattering from the thermal fluctuations in the liquid and 
the Stokes and anti-Stokes Brillouin peaks at co^ ^ ±ck due to scattering from acoustic 
phonons with sound speed c. For scattering in this wavevector and frequency range, the 
dynamical structure factor is symmetric in co since hco << kT. An early Brillouin scatttering 
measurement of the dynamical structure factor of a liquid is shown in Figure 15.10. 

The fluctuation-dissipation theorem enables one to develop a theory of the dynamical 
structure factor based on the hydrodynamic response of a system that is weakly perturbed 
from equilibrium. In the case of a fluid, small perturbations in pressure and tempera¬ 
ture result in propagating sound waves and thermal diffusion. This results in the following 
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FIGURE 15.10 Dynamical structure factor for carbon tetrachloride using 632.8 nm He-Ne laser with a 90° scattering 
angle. The structure factor is symmetric since hoi <sc kT. The Landau-Placzek ratio of the integrated intensity under 
the Rayleigh peak to the integrated intensities under the two Brillouin peaks is related to ratio of the constant- 
pressure heat capacity to the constant-volume heat capacity: Ir/(2Ib) = Cp/Cv — 1 = 0.72±0.03, Landau and 
Placzek (1934). The locations of the Brillouin peaks give the sound speed and the widths of the peaks measure 
the thermal diffusivity and sound attenuation coefficient of the liquid; see equation (45). From Cummins and 
Gammon (1966), reprinted with permission; copyright © 1966, American Institute of Physics. 


theoretical form for the dynamical structure factor: 




(45) 


The parameters in equation (45) are the thermal diffusivity Dp, the sound speed c, and the 
sound attenuation coefficient T, while y — Cp/Cy is the ratio of the constant-pressure and 
constant-volume heat capacities; see Forster (1975) and Hansen and McDonald (1986). 


15.7 The Onsager relations 


Most physical phenomena exhibit a kind of symmetry, sometimes referred to as reci¬ 
procity, that arises from certain basic properties of the microscopic processes that operate 
behind the (observable) macroscopic situations. A notable example of this is met with 
in the thermodynamics of irreversible processes where one deals with a variety of flow 
processes, such as heat flow, electric current, mass transfer, and so on. These flows (or 
“currents”) are driven by “forces,” such as a temperature difference, a potential difference, 
a pressure difference, and so on, which come into play because of a natural tendency 
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among physical systems which happen to be out of equilibrium to approach a state 
of equilibrium. If the given state of the system is not too far removed from a state of 
equilibrium, then one might assume a linear relationship between the forces Xj and the 
currents jq: 


X'i — YijXj, (1) 

where Yij are the kinetic coefficients of the system. 21 Simple examples of such coefficients 
are thermal conductivity, electrical conductivity, diffusion coefficient, and so on. There 
are, however, nondiagonal elements, Yijd f f)< as well that may or may not vanish; they are 
responsible for the so-called cross effects. It is the symmetry properties of the matrix (yij) 
that form the subject matter of this section. 

The most obvious way to approach this problem is to consider the entropy, S(Xj), of 
the system in the disturbed state relative to its maximal value, S(jq), in the relevant state of 
equilibrium. It is the natural tendency of the function S(x ; ) to approach its maximal value 
S(jq) that brings into play the driving forces Xj ; these forces give rise to currents x j} which 
take the “coordinates” x,- toward their equilibrium values jq. If the deviations (jq- — jq) are 
small, then the function S(x ; ) may be expressed as a Taylor series around the values x,- = jq; 
retaining terms up to the second order only, we have 


S(Xi) = S(Xi) + 



(Xi - Xj) 


1 / d 2 S 

2 l dxtfXj 


(Xi - Xj)(Xj - Xj). 

x i,j= x i,j 


( 2 ) 


In view of the fact that the function S(x,-) is maximum at x,- = jq, its first derivatives vanish; 
we may, therefore, write 


AS = S(x,-) - S(xb = 


1 

-Pij(Xi - Xi)(Xj - Xj), 


(3) 


where 


N = 


d 2 S 

dXidXj 



(4) 


The driving forces X\ may be defined in the spirit of the second law of thermodynamics, 
that is, 


Xi = 



fiij(Xj Xj). 


(5) 


21 In writing equation (1), and other subsequent equations, we follow the summation convention that implies an 
automatic summation over a repeated index. 
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We note that in the present approximation the forces Xj depend linearly on the dis¬ 
placements (x/ — x/); in the state of equilibrium, they vanish. Now, in view of equation 
(15.1.2), the ensemble average of the product XiXj is given by 

S-ooiXiXj) exp j -^PijlXi - xi){Xj - xj )} n dxi 

<■ xtXj) =- r— -i- l - -; (6) 

f-oc ex P -JicPij&i - x j)( x j - x jl n dx i 

1 J i 

the limits of integration in (6) have been extended to — oo and +oo because the integrals 
here do not draw any significant contribution from large values of the variables involved. 
In the same way, 


S-o o x i exp {- 21 Pi] - Xi ) (Xj -*/)} n dx i 

(Xi) = - 7— - r J - = Xi. (7) 

/-oo ex P ~JkPij( x i - x h ( x j - x jl n dx i 

1 J i 

Differentiating (7) with respect to xj (and remembering that the integral in the denomina¬ 
tor is a constant, independent of the actual values of the quantities x,), and comparing the 
resulting expression with (6), we obtain the remarkable result 

(XiXj) = -kSij. (8) 

We now proceed toward the key point of the argument. First of all, we note that, 
though equations (1) are concerned with irreversible phenomena, the microscopic pro¬ 
cesses underlying these phenomena obey time reversal, which means that the temporal 
correlations of the relevant variables are the same whether measured forward or backward 
in time. Thus, 


(X;(0)X/(S)) = (x,-(0)x/(-s)); 


(9) 


also, by a shift in the zero of time, 


(Xi(O)Xj(-S)) = (Xi(s)Xj( 0)). 


( 10 ) 


Combining (9) and (10), we get 


<x,(0)*y(s)> - (x,;(s)Xj(0)). (11) 

If we now subtract, from both sides of this equation, the quantity (x/(0)Xj(0)), divide the 
resulting equation by 5 and let 5 0, we obtain 


(Xi(0)3ty(0» = (Xi(O)Xj(O)). 


(12) 
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Substituting from (1) and making use of (8), we obtain on the left side of (12) 


(Xi(O)yjiXi(O)) — kyjiSji — kyji 


and on its right side 


(YuXi(O)Xj(Q)) — kyuSji — kyij. 


It follows that 


Yij ~ Yji- 


(13) 


Equations (13) constitute the Onsager reciprocity relations ; they were first derived by 
Onsager in 1931 and have become an essential part of the thermodynamics of irreversible 
phenomena. 

In view of equations (1) and (13), the currents x,- may be written as 


i. = V 
‘ dXi’ 

where the generating function f is a quadratic function of the forces Xy. 


(14) 


/= 2 ///*/*/• 


(15) 


The function / is especially important in that it determines directly the rate at which the 
entropy of the system changes with time: 


s -^ Xi - x ‘ x ‘~ x ‘/xr 2f - 


(16) 


As the system approaches the state of equilibrium, its entropy must increase toward the 
equilibrium value S( jq). The function / must, therefore, be positive definite, which places 
certain restrictions on the coefficients y^. 

Analogous to equation (1), we could also write 


Xi — C ij (Xj Xj ) , (17) 

the quantities ijij being another set of coefficients pertaining to the system. From equa¬ 
tions (1) and (5), on the other hand, we obtain 


Xi — fiijXj — fiij(yjiXi) — X L jyji { fii m (x m x m )} 

— PijYjlfilm( x m ~ %)■ (18) 

Comparing (17) and (18), we obtain a relationship between the new coefficients Oj and the 
kinetic coefficients yy. 


Him — PijYjlPlm- 


( 19 ) 
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Now, in view of the symmetry properties of the matrices and y, we get 


Cim — (mi> 


(20) 


thus, the coefficients Cj> introduced through the phenomenological equations (17), also 
obey the reciprocity relations. It then follows that the quantities X;, see equation (14), may 
be written as, 



where 

/' = ^ij(Xi-Xi)(Xj-Xj). 

The entropy change dS may now be written as 


35 

dS = — dx; = X;dx; = —Pji(Xi — xpdxj 


dXj 


= {Xj - Xi)d{-Pij(Xj - Xj)} = ( Xi - xpdXi, 


( 21 ) 


( 22 ) 


( 23 ) 


so that 

dS 

„„ =(Xi-Xi); ( 24 ) 

oXi 

clearly, the entropy S is now regarded as an explicit function of the forces X, (rather than of 
the coordinates jCj). The time derivative of S now takes the form 

S = ^= ( Xi - ~ Xi ) =2f. ( 25 ) 

3 Aj oXi 

Comparing (16) and (25), we conclude that the functions / and/' are, in fact, the same; 
they are only expressed in terms of two different sets of variables. 

It seems important to mention here that Onsager’s reciprocity relations have an inti¬ 
mate connection with the fluctuation-dissipation theorem of the preceding section. Fol¬ 
lowing equations (15.6.18) and (15.6.19), and adopting the summation convention, we 
have in the present context 


t 

Xiit) = kT J Ei(t')(xi(l!)Xi(t))dt' 

—OO 


( 26 ) 


or, setting (t - t') = s, 


Xi(t) = 


1 

kT 



o 


S)(x/(t - s)(x { (t))ds; 


( 27 ) 
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compare this to equation (1). Interchanging the indices i and l, we obtain 


OO 



(28) 


o 


The crucial point now is that the correlation functions appearing in equations (27) and 
(28) are identical in value, for 


(X,(t - s)Xi(t)) = (xi(O)Xi(s)) = (xi(O)Xi(-s)) = ( ki(t)Xi(t-s )); 


(29) 


in establishing (29), the first and third steps followed from “a shift in time” while the sec¬ 
ond step followed from the “principle of dynamical reversibility of microscopic processes.” 
The equivalence depicted in equation (29) is, in essence, the content of Onsager’s reci¬ 
procity relations. In particular, if the correlation functions appearing in (27) and (28) are 
sharply peaked at the value 5 = 0, then these equations reduce to the phenomenological 
equations (1), and equation (29) becomes synonymous with the Onsager relations (13). 

In the end, we make some further remarks concerning relations (13). We recall that, in 
arriving at these relations, we had to make an appeal to the invariance of the microscopic 
processes under time reversal. The situation is somewhat different in the case of a “system 
in rotation” (or a “system in an external magnetic field”), for then the invariance under 
time reversal holds only if there is also a simultaneous change of sign of the angular veloc¬ 
ity S (or of the magnetic field B). The kinetic coefficients, which in this case might depend 
on the parameter (or B), will now satisfy the relations 


Yij(tt) = Yji(-&) 


(13a) 


and 


(13b) 


Yij(B) = Yji(-B). 


In addition, our proof here rested on the implicit assumption that the quantities x* 
themselves do not change under time reversal. If, for some reason, these quantities 
are proportional to the velocities of a certain macroscopic motion, then they will also 
change their sign under time reversal. Now, if both x t and Xj belong to this category, then 
equation (12), which is crucial to our proof, would remain unaltered; consequently, the 
coefficients Yij and yji would still be equal. However, if only one of them belongs to this 
category while the other one does not, then equation (12) would change to 


(Xi(O)Xj(O)) = -(xKO)xy(O)); 


( 12 ') 


the coefficients yy and yji would then obey the relations 


Yij --Yji- 


( 13 ') 
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For the application of Onsager’s relations to different physical problems, reference may be 
made to the monographs by de Groot (1951), de Groot and Mazur (1962), and Prigogine 
(1967). 


Problems 


15.1. Making use of expressions (15.1.11) and (15.1.12) for AS and A P, and expressions (15.1.14) for 
(AT) 2 ,(AV) 2 , and (ATAV), show that 

(a) (ATAS) = JcT; (b) (APAV) = -kT ; 

(c) (ASAV) = kT(dV/dT)p; (d) (APAT) = kT 2 Cy CdP/dT) v . 

[Note that results (a) and (b) give: (ATAS - APAV) = 2IcT, which follows directly from the 
probability distribution function (15.1.8).] 

15.2. Establish the probability distribution (15.1.15), which leads to the expressions in (15.1.16) for 
(AS) 2 , (A P) 2 , and (ASAP). Show that these results can also be obtained by following the 
procedure of the preceding problem. 

15.3. If we choose the quantities E and V as “independent” variables, then the probability distribution 
function (15.1.8) does not reduce to a form as simple as (15.1.13) or (15.1.15); it is marked instead 
by the presence, in the exponent, of a cross term proportional to the product AEAV. 
Consequently, the variables E and V are not statistically independent-. (AEAV) ^ 0. 

Show that 


(AEAV) = kT 




verify as well expressions (15.1.14) and (15.1.18) for (AV) 2 and (A E) 2 . 

[Note that in the case of a two-dimensional normal distribution, namely 


p(x,y) ex exp 


1 

2 


(ax 2 + 2 bxy + cy 2 ) 


the quantities (x 2 ),(xy), and (y 2 ) can be obtained in a straightforward manner by carrying out a 
logarithmic differentiation of the formula 


// 


exp 


1 

2 


(ax 2 + 2 bxy + cy 2 


dxdy = 


2tt 

s/ (ac - b 2 ) 


with respect to the parameters a, b, and c. This leads to the covariance matrix of the distribution, 
namely 


If b = 0, then 


((x 2 ) {xy)\ = 1 ( c -b\ 

\(yx) (y 2 )J (flc-h 2 )V-& a)' 

(x 2 ) = 1 /a, (xy) = 0, (y 2 ) = 1/c.] 22 


15.4. A string of length l is stretched, under a constant tension F, between two fixed points A and B. 
Show that the mean square (fluctuational) displacement y(x) at point P, distant x from A, is 
given by 


{y(x)} 2 = —x(l-x). 

22 For the covariance matrix of an n-dimensional normal distribution, see Landau and Lifshitz (1958), Section 110. 
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Further show that, for X 2 > xi, 


y(xi)y(x 2 ) = —xi (l-x 2 ). 

[Hint: Calculate the energy, O, associated with the fluctuation in question; the desired 
probability distribution is then given by p oc exp(—O /kT), from which the required averages can 
be readily evaluated.] 

15.5. How small must the volume, Va, of a gaseous subsystem (at normal temperature and pressure) be, 
so that the root-mean-square deviation in the number, Na, of particles occupying this volume be 

1 percent of the mean value Na ? 

15.6. Pospisil (1927) observed the Brownian motion of soot particles, of radii 0.4 x 10 4 cm, immersed 
in a water-glycerine solution, of viscosity 0.0278 poise at a temperature of 18.8°C. The observed 
value of x 2 , in a 10-second time interval, was 3.3 x 10 "cm 2 . Making use of these data, determine 
the Boltzmann constant k. 

15.7. In the notation of Section 15.3, show that for a Brownian particle 

{v(t) ■ F(t )) = 3kT/z, while (v(t) ■ F{t)) = 0. 

On the other hand, 

(r(f) • 3? (f)) = —3kT, while (r(f) ■ F(f)) = 0. 

15.8. Integrate equation (15.3.14) to obtain 

t 

r(t ) = i>(0)r(l - e~ t/T ) + z J {1 - e (u ~ t)lx }A{u)du, 

0 

so that r(0) = 0. Taking the square of this expression and making use of the autocorrelation 
function^ (s), derive formula (15.3.31) for (r 2 (f)). 

15.9. While detecting a very feeble current with the help of a moving-coil galvanometer, one must 
ensure that an observed deflection is not just a stray kick arising from the Brownian motion of the 
suspended system. If we agree that a deflection 9, whose magnitude exceeds 40 r . ms . [= 4(A.T/c) 1 / 2 ], 
is highly unlikely to be due to the Brownian motion, we obtain a lower limit to the magnitude of the 
current that can be reliably recorded with the help of the given galvanometer. Express this limiting 
current in terms of the time period r and the critical damping resistance R, of the galvanometer. 

15.10. (a) Integrate Langevin’s equation (15.3.5) for the velocity component v x over a small interval of 

time St, and show that 

(Sv x ) v x ((Sn x ) 2 ) 2 kT 

—— = and — = ——. 

St r St Mr 

(b) Now, set up the Fokker-Planck equation for the distribution function f(v x , t) and, making use 
of the foregoing results for /j.\ ( v x ) and fi 2 (v x ), derive an explicit expression for this function. 
Study the various cases of interest, especially the one for which t » z. 

15.11. Generalize the analysis of the Langevin theory of a harmonic oscillator, as given by equation 
(15.3.33), to the case of an oscillator starting at time t = 0 with the initial position x(0) and 
the initial velocity f(0). Derive, for this system, the quantities (x 2 (f)) and (iPtf)) and show that, 
in the limit wo - 5 - 0, these expressions reproduce equations (15.3.29) and (15.3.31) while, in the 
limit M ->• 0, they lead to the relevant results of Section 15.4. 

15.12. Generalize the Fokker-Planck equation to the case of a particle executing Brownian motion in 
three dimensions. Determine the general solution of this equation and study its important 
features. 

15.13. The autocorrelation function K(s) of a certain statistically stationaiy variable y(t) is given by 

(a) K(s) = K(0)e~ a ^ cos(2 nf*s) 
or by 

(b) K(s) = F(0)e-“ |s| cos ( 2 t rf*s), 
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where a > 0. Determine, and discuss the nature of, the power spectrum w(f) in each of these cases 
and investigate its behavior in the limits (a) a -*■ 0, (b) f* -> 0, and (c) both a and/* -> 0. 

15.14. Show that if the autocorrelation function K(s) of a certain statistically stationary variable y(t) is 
given by 

sin(fli) sin (bs) 

K(s) = K(0) - - - (a>b> 0), 


as bs 

then the power spectrum w(f) of that variable is given by 


w(f) = ^K( 0) 
ab 


0 


for 0 < / < 


a-b 
2 n ’ 


-ttf 


, a-b a+b 

for -< f <- 

2tt ~ J ~ 2n 

, a+b t 

lor -< t < oo. 

2n ~ J 


Verify that the function w(f) satisfies the requirement (15.5.16). 

[Note that, in the limit b 0, we recover the situation pertaining to equations (15.5.18).] 
15.15. (a) Show that the mean square value of the variable F(f), defined by the formula 


U-\~L 

Y(t)= J y(u)du, 


where y(u) is a statistically stationary variable with power spectrum w(f), is given by 

OO 

w{f) 


(Y (t)) = 


*{■ 


2 ,1 - cos(2nft)} df] 


and, accordingly, 


OO 

7: 


w(f) = 4nf j -^j(Y 2 (t)) sin(2jrft)dt 


= 2 


OO 

J 


■^(Y 2 (t))cos(2nft)dt. 


For details, see MacDonald (1962), Section 2.2.1. A comparison of the last result with equation 
(15.5.14) suggests that 

1 3 2 - 

K y (s) = 2a? <y(s)> - 

(b) Apply the foregoing analysis to the motion of a Brownian particle, taking y to be the velocity of 
the particle and Y its displacement. 

15.16. Show that the power spectra w v (f) and Wa (J) of the fluctuating variables v(t) and A(t) that appear 
in the Langevin equation (15.3.5) are connected by the relation 


w v (f) = w A (f) 


1 + (2nfr) z ’ 


r being the relaxation time of the problem. Hence, by equation (15.5.21) w A (f) = 12 kT/Mr. 
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15.17. (a) Verify equations (15.6.7) through (15.6.9). 

(b) Substituting expression (15.6.9) for K v (s) into equation (15.6.6), derive formula (15.3.31) for 
(r 2 (f)). 

15.18. Determine y", x U’>) and S vx {<d) for a Brownian particle in a harmonic oscillator potential. Show that 
the response function and the power spectrum for this case are related by the classical limit of the 
fluctuation-dissipation theorem. 

15.19. Derive the linear response density matrix (15.6.29) from the equation of motion (15.6.28). 

15.20. Show that GabM = GbaU — ipfr) and use the cyclic property of the traces to derive the 
fluctuation-dissipation theorem x^ B (cu) = ^ (1 — e~P ho> ') Sab(o>)- 

15.21. Show that GabU) = GbaV — ifih). Use this result to show that, in the classical limit, x^g(f) becomes 

(^j^B(O)j. Further show that this leads to equation (15.6.39). 

15.22. Determine the self-diffusion term in the dynamical structure factor S se if(fc, o>) in 
equation (15.6.41b) for the case of a single particle that diffuses according to the diffusion 
equation. Assume the process to be Gaussian for which (e?) = exp ((f 2 /2)). 

15.23. Determine the angular frequency for the Brillouin peaks in water for 90° laser scattering, using 
a He-Ne laser with k = 632.8 nm. Determine the width of the Rayleigh peak and show that the 
Brillouin peaks are well-separated from the Rayleigh peak. The thermal diffusivity of water is 

Dj = 1.4 x 10 -7 m 2 /s. 

15.24. Describe the dynamical structure factor for Raman scattering for a He-Ne laser with k = 632.8nm. 
The energy level responsible for this scattering has an energy of 0.05 eV and the lifetime of this 
state is 1 picosecond. Are the Stokes/anti-Stokes scatterings symmetric as w -»• —id at room 
temperature? 



Computer Simulations 


Computer simulations play an important role in modern statistical mechanics. The history 
of the use of computer simulations in science parallels the history of early digital comput¬ 
ing. The people and places involved centered around Los Alamos and other U.S. national 
laboratories where the first digital computers became available for use by scientists after 
World War II. Early leaders in the development of computer simulation methods included 
Fermi, Ulam, von Neumann, Teller, Metropolis, Rosenbluth, and others who were also 
involved in the Manhattan Project (Metropolis, 1987). 

Computer simulations in statistical mechanics fall into two broad classes: Monte Carlo 
(MC) and molecular dynamics (MD), although variants span the range between the two. 
Both methods involve numerically evolving simple models of materials through a set of 
microstates in order to determine the thermodynamic averages of measurable quantities. 
Computer simulations provide a means to study physical systems that is complementary 
to both experiment and theory. The following are a few of the advantages of computer 
simulations: 

• Computer simulations can provide insight into the equilibrium and nonequilibrium 
behavior of model systems for ranges of parameters where theoretical approximations 
are invalid or untested. 

• Computer simulations provide a means to test the range of validity of theoretical 
approximations against specific model systems. 

• Computer simulations allow visualization of physical processes that can provide new 
insights into complex phenomena. 

• Computer simulations allow detailed examination of behaviors that might not be 
accessible experimentally. 

• Computer simulations can be used to examine fundamental physical processes that 
can be used to guide theory. 

• Computer simulations can be used to model systems that do not exist in nature to 
provide assistance in understanding existing materials and engineering new ones. 


16.1 Introduction and statistics 

While certain critical aspects of computer simulation theory should be followed rigorously, 
much of computer simulation development and use is an art form. There are many possi¬ 
ble simulation approaches for any given problem and some choices will be more effective 
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at elucidating important physical properties than others. This brief chapter concentrates 
on equilibrium simulations but computer simulations are also widely used to model 
dynamical and nonequilibrium processes. The task of determining equilibrium thermody¬ 
namic averages of model systems is accomplished by generating a sequence of microstates 
that are chosen from the equilibrium ensemble of the model. For example, an MD simula¬ 
tion might be used to integrate Newton’s equations of motion for generating a time-series 
of states in phase space as the system explores the constant-energy hypersurface of the 
Hamiltonian. By comparison, an MC simulation of the same model might generate a 
sequence of states chosen by a random walk among the configurational microstates of the 
canonical ensemble. Both methods are examples of importance sampling, which focuses 
computational effort on generating microstates that are representative of the equilibrium 
ensemble rather than sampling all of the phase space. It is this huge improvement in effi¬ 
ciency that makes computer simulations of statistical mechanical models feasible. The 
sequence of states produced by either method can be used to estimate equilibrium aver¬ 
ages. Allen and Tildesley (1990), Binder and Heermann (2002), Frenkel and Smit (2002), 
and Landau and Binder (2009) provide more detailed discussions of computer simulations 
and their applications in statistical physics. 

Let q represent a microstate of the system and A(q) a thermodynamic observable that 
is a function of the microstate. In an MC simulation, q might represent the positions of 
all the particles in the system while in an MD simulation q might represent the posi¬ 
tions and momenta of all the particles. The observable A(q) might represent the potential 
energy, virial contribution to the pressure, pair correlation function, and so on. The initial 
microstate chosen to start a simulation will generally not be typical of the set of microstates 
that make up the equilibrium ensemble, but the goal of a simulation is to evolve the 
microstate through a large enough subset of the microstates of the equilibrium ensemble 
so that averages of observables approach their equilibrium values. After a simulation has 
run long enough for the system to approach equilibrium, the simulation then generates a 
sequence of M configurations, {qj)^L A , chosen from the set of microstates in the equilib¬ 
rium ensemble and stores a sequence of values, {A(qj')}j ^ ] , for each of the thermodynamic 
variables one wants to measure. 1 Since the microstates are chosen from the equilibrium 
ensemble, the equilibrium average of A is approximated by a simple average of the set of 
values {A(qf)}j ^ } . Of course, a simulation can only provide a finite sequence of states, so a 
statistical analysis of the uncertainty of the results is a crucial part of any simulation. 

The equilibrium average of the variable A is given by 


{A) - (A) m ± a M 


( 1 ) 


'Alternatively, one can store the full configuration set of statistically independent microstates for later analysis. This 
tactic requires a large amount of storage space but is useful if there is a large computational cost of generating statistically 
independent configurations and one needs to calculate averages of many different observables at a later time using the 
stored configurations. This is sometimes done in large-scale lattice quantum chromodynamics simulations. 
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where the simulation average ( A) M and uncertainty ctm are determined by 


1 M 

< a > m = 

j= i 

(2a) 

V ( A2 )m - 

M VM/( 2r + l) ’ 

1 M 

( a2 ) M - 

(2b) 


(2c) 


The “correlation time’’ r is defined as follows. Since the states qj are generated sequentially 
by the simulation, each new state qj +1 is guaranteed to be close to the previous state qj, so 
the values A(qj) in the sequence are highly correlated. The correlations in the values of 
Aiqf) decrease with the “correlation time” r which can be calculated from the correlation 
function (PaaW, namely 


(A(t)A(O)) - ( A{t )) (A(0)> 
= - 

r = 

t> 0 


(3a) 

(3b) 


The variable t is a measure of the separation between pairs of configurations in the ordered 
sequence. In the case of molecular dynamics simulations, r represents a physical time 
for the system to move far enough along its trajectory on the energy surface to result in 
decorrelated values of A. Monte Carlo simulations explore equilibrium microstates in a 
random walk, so r does not correspond to physical time but rather the average number 
of Monte Carlo sweeps needed to give statistically independent values for A. The quan¬ 
tity M/(2r + 1) represents the number of statistically independent configurations in the 
sequence of M values. 2 

2 The correlations 0 a 4(O in equation (16.1.3a) can be measured using subsequences of the M configurations: 

X 

(A(t)A( 0)) * — J2 A ^j+t)A{qj), 

]=i 


and 

1 M ' 

A(t)) * 

i= i 

By definition, the correlation function 4>aa(0) is unity and the correlations decay to zero as t -v oo. Once one can place 
a reliable upper bound on the size of the correlation time r for a given system from a knowledge of its equilibrium 
correlations, one can simply skip more than r configurations between storing values of A(qj) to ensure that the numbers 
in the sequence are now approximately statistically independent. 
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16.2 Monte Carlo simulations 

The term Monte Carlo method, named for the gambling casinos in Monaco, was coined 
by Nicholas Metropolis (1987) - “a suggestion not unrelated to the fact that Stan [Ulam] 
had an uncle who would borrow money from relatives because he ‘just had to go to Monte 
Carlo’.” The goal of a Monte Carlo simulation in equilibrium statistical mechanics is to 
use pseudorandom numbers to draw a representative sample of microstates {q} from the 
equilibrium probability distribution 


Peq(q) — 


exp (-pE(q)) 
Hq' exp [-pE(q')\ 


( 1 ) 


This means that “instead of choosing configurations randomly, then weighting them with 
exp (—E/kT), we choose configurations with a probability exp (—E/kT) and weight them 
evenly” (Metropolis et al., 1953). If a simulation can accomplish this, then thermodynamic 
averages can be calculated using the simple averages in equation (16.1.2). This importance 
sampling of the states provides a huge computational advantage over normal random 
sampling. 

The following algorithm accomplishes the goal of randomly selecting microstates q 
from the set of all microstates with a probability distribution that approaches the equilib¬ 
rium distribution (1) (Metropolis et al., 1953; Kalos and Whitlock, 1986; Allen and Tildesley, 
1990; Frenkel and Smit, 2002; Binder and Heermann, 2002; Landau and Binder, 2009). Con¬ 
sider an ensemble of microstates that has some initial distribution of probabilities P{q, 0) 
and let the distribution evolve according to the discrete stochastic rate equation 


P(q,t+ 1) = P(q, t) + p (qt)W(q' -+q)~ P(q, t)^W(q -> q'), 

q' q' 


( 2 ) 


where W(q ->• a[) is the transition rate from state q to state c(. If the transition rate obeys 
the balance condition 


J^Peqiq'lWiq 1 -> q) = P eq (q)J2 W( q <?')> (3) 

q 1 q' 

and the random process in equation (2) can reach every microstate from every other 
microstate in a finite number of steps, then the ensemble probability will approach the 
equilibrium distribution: 


lim P(q,t) = P eq (q). (4) 

£—>-oo 

In practice, equation (3) is usually implemented using the detailed balance condition 

P eq (q)W{q q') = P eq (q')W(q' -* q). (5) 

Evaluating P e q(q) requires summing over all states to determine the partition function, 
but the ratio P e q(q')/Peq(q) depends only on the energy difference A E = E(q') - E(q). 
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Therefore, the transition rates are related by 

W(q -* q') = exp(— /3AE) W(q r -> q). (6) 

This guarantees that the sequence of states generated by this stochastic process, begin¬ 
ning from any starting configuration, asymptotically becomes equivalent to selecting 
states by a random walk among the microstates of the equilibrium ensemble. This can 
be implemented in a computer code, as first proposed by Metropolis et al. (1953), using 
the transition rates 


W(q-^cj)= 1 ifAf?<0, 

W{q -*•</) = exp(—jSAf?) if A E > 0. (7) 

Other choices for the transition rates are possible but this form, named after Metropolis, is 
one of the most commonly used. 

16.2.A Metropolis Monte Carlo algorithm 

The Metropolis method can be implemented in a computer program by using a pseudo¬ 
random number generator rand() that returns pseudorandom numbers that are uniformly 
distributed on the open unit interval (0.0,1.0); see Appendix I for a discussion of how 
pseudorandom numbers are generated. First, initialize the system by choosing a starting 
state qo from the set of all microstates of the model. It is helpful if qo is not atypical of 
the states in the equilibrium ensemble. This reduces the number of steps needed for the 
system to equilibrate. For example, a disordered liquid-like state would not be the best 
starting point for a simulation of a crystalline solid. 

The Metropolis algorithm is defined by the following steps: 

1. Generate a random trial state q tIia i that is “nearby” the current state qj of the system. 
“Nearby” here means that the trial state should be almost identical to the current state 
except for a small random change made, usually, to a single particle or spin. For 
example, one can create a trial state of a particle simulation by randomly moving one 
particle to a nearby location 


xf ial = x t + Ax(rand() - 0.5), (8) 

with two more calls to rand() to generate j/} rml and zj. The trial state of a spin system 
usually involves a spin flip or a random rotation of a single spin. 3 

3 There are Monte Carlo algorithms for spin systems that flip spins in large correlated clusters rather than one spin at a 
time; see Swendsen and Wang (1987) and Wolff (1989). These methods are very effective for simulations of some particu¬ 
lar models. Also, one can attempt spin flips of all the spins on noninteracting sublattices at one time since the acceptance 
of each flip is independent of the other flipped spins. For example, a chessboard pattern update of a spin model in which 
spins only interact with nearest neighbors of a square lattice can be more efficient for some computer architectures or 
programming environments. 
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2. Determine the change in the energy of the trial state compared to the previous 
state, namely A E — E(q u -ia\) - E(qj). If A E < 0, accept the trial state, that is, set 

qj + 1 = Atrial- If > 0, then accept the trial state with probability exp(-j6 A E). This is 
accomplished by using an additional call to the pseudorandom number generator. If 
rand() < exp (—ft AE), then accept the trial state. If the interactions are short-ranged, 
the calculation of the energy change will only involve interactions with a few nearby 
particles or spins. If the trial state is illegal in some way, that is, it is not an allowed 
state in the set of all configurations, then the state should be rejected. This is 
equivalent to setting the energy change at +oo. If the trial state is rejected for either 
reason, then set the new state of the system equal to the previous state qj + \ — qj, that 
is, leave the state at the old value qj, throw away the trial state, and move on. 

3. Perform steps 1 and 2 once for each particle or spin in the system. This is often done 
randomly to ensure detailed balance. 4 Steps 1 through 3 define one Monte Carlo 
sweep. 

4. Repeat steps 1 through 3 for M eq Monte Carlo sweeps to let the system equilibrate. The 
proper choice of M eq is not obvious a priori. At the very least, all the measures A(q) 
studied in the simulation should no longer have any obvious monotonic drift by 

the end of equilibration. This does not guarantee that the system has reached 
equilibrium since the system could well be trapped in the vicinity of a long-lived 
metastable state. 

5. Repeat steps 1 through 3 for M Monte Carlo sweeps while keeping track of all the 
thermodynamic variables one wants to measure, namely {Aiqj )}^. Use 
equations (16.1.1) and (16.1.2) to determine the equilibrium averages and 
uncertainties. 

To determine averages at a different set of parameters (temperature, density, etc.), change 
the parameters by a small amount and repeat steps 1 through 5, including the equilibration 
step 4. 5 Using the last configuration of the previous run as the first configuration of the next 
run can often reduce the equilibration time. Figure 16.1 shows a Monte Carlo calculation 
of the specific heat of the two-dimensional Ising model on a 128 x 128 square lattice, as 
compared to the exact solution presented in Section 13.4.A. 


4 Sequential and other update methods that violate detailed balance are sometimes used for efficiency but special 
care should be taken to ensure that detailed balance is maintained on average. 

5 Histogram reweighting methods can sometimes be used to reduce the number of temperatures and fields that need 
to be simulated (Ferrenberg and Swendsen, 1988). For example, if a spin simulation at coupling K and field h collects a 
histogram that samples the joint energy-magnetization distribution P^^(E,M), the distribution at nearby temperatures 
and fields is given by 


P KM (E,M)e AKE+&hM 

K+AK,h+Ah(, ' M >- J2 EM P K , h (E,M)e^B+AhM- 

Other methods that are now widely used are: parallel tempering, multicanonical Monte Carlo, and “broad histogram" 
methods. These are particularly effective for studying systems with strongly first-order phase transitions. Monte Carlo 
renormalization group methods are very powerful for studying critical points. Fora survey, see Landau and Binder (2009). 
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FIGURE 16.1 Monte Carlo specific heat (x's) of the two-dimensional Ising model on a 128 x 128 lattice, as compared 
to the exact solution (solid line) from Section 13.4.A; see Kaufman (1948), Ferdinand and Fisher (1967), and Beale 
(1996). The MC error bars are smaller than the symbols used, except near the bulk critical temperature T c (oo). Each 
data point represents an average using 10 s Monte Carlo sweeps, except at the bulk critical point where 10 6 Monte 
Carlo sweeps were used to mitigate critical slowing down. 


16.3 Molecular dynamics 

The purpose of a molecular dynamics simulation is to integrate Newton’s equations of 
motion for the set of particles in the given system. One advantage of MD over MC is that 
it approximates the time evolution of the equations of motion of the system, so MD can 
be used to study a host of dynamical properties. MD is usually more efficient at simulat¬ 
ing systems with long-range interactions since all the particles are updated together. MD 
is sometimes easier to implement than MC for complex systems since appropriate MC 
moves are sometimes difficult to derive. There are MD variants that allow simulations of 
other ensembles, but the simplest case simulates a microcanonical ensemble in which the 
microstate of the given system explores its energy surface in the phase space; see Allen and 
Tildesley (1990) and Frenkel and Smit (2002) for details. 

The equations of motion here are 


d 2 rj 
dt 2 



1 

— V/f7 (ri,r 2 ,..,rjv), 
m. 


( 1 ) 


where F, is the force on particle i arising from the iV-particle potential energy function 
U. The MD simulation moves the system forward in time by discrete steps At. The most 
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commonly used integration method in this context is due to Verlet (1967): 


r,-(f + At) = 2r,-(0 - n(t- At) + 


(At) 2 




(2) 


This is equivalent to the leap-frog and velocity Verlet algorithms that update both positions 
and velocities of the particles; see Frenkel and Smit (2002). The Verlet method preserves the 
time-reversal symmetry of the Hamiltonian equations of motion and has an error per step 
of order (At) 4 , while only requiring a single determination of the force on each particle, 
which is usually the most computationally time-consuming part of the simulation. Most 
importantly, the Verlet algorithm is symplectic, so the integration is equivalent to an exact 
solution of a “nearby” ghost Hamiltonian, which results in good long-term stability and 
good conservation of energy properties. 

A simulation starts the system in some initial microstate with defined positions and 
velocities of all the particles, and the integration algorithm steps the positions and veloc¬ 
ities of the particles forward in time. The simplest forms of the approximations employed 
for the velocities and the energy are 


Vi(t) = 


rj(t + At) - rj(t - At) 
2At 


M . p 

mi ( r t (t + At) - r t (t - AtC z 


_ x—' '>h 


i=l 


2At 


+ U[ri(t),r 2 (t),..,r N (t)]. 


(3a) 

(3b) 


The time-step At is chosen to be small as compared to the shortest fundamental time scale 
in the Hamiltonian, while not so small as to limit the efficiency of the program. The numer¬ 
ical integration approximates a member of the microcanonical ensemble moving along 
the constant-energy hypersurface in the phase space. Calculating equilibrium averages 
properly depends on the Hamiltonian being ergodic; 6 this allows the system to sample 
all regions of the constant-energy hypersurface, so the MD time-averages are equivalent 
to averages over the microcanonical ensemble. If the total energy drifts more than some 
predetermined amount during the course of the simulation, then all the velocities can 
be rescaled to shift the total energy back to its initial value. Alternatively, one can use a 
thermostat to maintain the temperature at a desired value; see Frenkel and Smit (2002). 

6 Since an MD simulation creates a time evolution of the model system, one needs some assurance that the sys¬ 
tem is ergodic, that is, the time averages and the ensemble averages are the same. For example, a system of harmonic 
oscillators is not ergodic. A system of IV particles in d dimensions has a 6N-dimensional phase space, so the constant- 
energy hypersurface has 6 N — 1 dimensions. The normal mode solution for N coupled oscillators has 3 N constants of 
the motion, so the system explores only a 31V-dimensional hypersurface. Even making the couplings between particles 
anharmonic does not eliminate the problem as first shown by Fermi, Pasta, and Ulam (1955) and explored theoretically 
by Kolmogorov (1954), Arnold (1963), and Moser (1962). MD simulations of equilibrium systems presume that the sys¬ 
tem is ergodic. There are only a few systems that are provably ergodic but, fortunately, most systems with realistic pair 
potentials appear to behave ergodically in two or more dimensions. In view of this, MD simulations of one-dimensional 
systems should be treated as suspect from this perspective. 
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A commonly used pair interaction for monatomic fluids such as neon and argon is the 
Lennard-Jones interaction 


u(r) = 4e 




( 4 ) 


where D is the molecular diameter and e is the depth of the attractive well. The Lennard- 
Jones potential is attractive at long distances, and decays as 1/r 6 to model the van der 
Waals attraction; at short distances, it diverges as 1/r 12 to model the Pauli repulsion 
that prevents overlap of the electronic wavefunctions. Simulations are best carried out 
using dimensionless parameters. In the case of a fluid with Lennard-Jones interactions, 
all lengths can be measured in units of D, all energies (including kT) in units of e, all forces 
in units of e/D, all pressures in units of e/D 3 , all times in units of v / mD 2 /s , and so on. Simu¬ 
lations then need to be conducted only for single values of reduced temperature kT /e, the 
reduced density nD 3 , and so on, while measuring observables in reduced units. Compar¬ 
isons between simulations and experimental results can then be made using experimental 
values of D, m, and e. In dimensionless units, the Lennard-Jones force between a pair of 
particles is 


F = 





( 5 ) 


Newton’s third law of motion can be used to reduce the number of force calculations by 
a factor of two. The Lennard-Jones model was first studied in an MC simulation by Wood 
and Parker (1957) and in an MD simulation by Rahman (1964) and Verlet (1967). 


16.3.A Molecular dynamics algorithm 

First, start the system by choosing an initial state by setting the initial positions and veloc¬ 
ities of all the particles. The initial velocities are usually set by choosing each component 
of the velocity vector of each particle from the Maxwell distribution. In reduced units, 
this is 


PMaxwell(Ct(0)) = — exp 

\J2tz L 



( 6 ) 


7 For example, the Lennard-Jones parameters appropriate for argon are e/fc = 119.8 K and D = 0.3405 nm (Levelt, 1960; 
Rowley, Nicholson, and Parsonage, 1975). Interaction potentials are almost always cut off at a finite distance between 
molecules to reduce the number of interactions that need to be considered at each time step. For the Lennard-Jones 
interaction, this is most commonly done at r ma x = 2.5 D. If the potential is set to zero for distances greater than r max , 
then this would leave a small discontinuity in the potential. To eliminate this, the potential is often shifted upward by 
-u(2.5D) ~ 0.0163s, so that the potential is zero at /‘max- This allows a direct comparison between MC and MD simula¬ 
tions. If the shift is not made in the potential, one could not directly compare the results from MC and MD simulations 
because the discontinuity in the potential would result in a delta-function force that affects the motion in the MD sim¬ 
ulation but not the configurations in the MC simulation. Comparisons of MC and MD results with experiments need to 
include perturbations from the shift and the missing tails of the pair potentials. 
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see Appendix I to see how to use a uniform pseudorandom number generator to select 
from a Gaussian distribution. The initial velocities can then be used to set the positions of 
the particles after the first time-step, namely 


r,-(A t) = r;(0) + i>,'(0)Af + 


1 (At) 2 

2 m t 


Fi( 0 ). 


(7) 


1. Next, use equation (2) to move the system forward in time through M eq = r eq /A£ time 
steps. The equilibration time r eq must be chosen large enough for the system to 
equilibrate; see the Monte Carlo discussion in Section 16.2. A thermostat is often used 
to evolve the system to a state with the desired temperature; see Frenkel and Smit 
( 2002 ). 

2. Now, use equation (2) to move the system forward in time through M = r aV g/A t time 
steps while keeping track of all the thermodynamic variables {A(qj)} one wants to 
measure. Finally, use equation (16.1.2) to determine the equilibrium averages and 
uncertainties. 

To determine averages at a different set of parameters (temperature, density, etc.), change 
the parameters by a small amount and repeat steps 1 and 2. Using the last configura¬ 
tion of the previous run as the first configuration of the new run can often reduce the 
equilibration time. 


16.4 Particle simulations 

Fluids can be modeled by both MC and MD simulations by placing N particles in a periodic 
box with volume V interacting via a pair potential. Hansen and McDonald (1986), Allen 
and Tildesley (1990), and Frenkel and Smit (2002) provide excellent surveys of this topic. 
Calculating energy changes of trial moves in MC or forces in MD only involves pairs of 
particles whose closest periodic copies are within the cutoff distance of each other. MC 
simulations typically sample the canonical ensemble, 8 so they control the temperature 
and density, and measure the energy, pressure, and so on. MD simulations typically sample 
the microcanonical ensemble, 9 so they control the energy and density, and they measure 
the temperature, pressure, and so on. The equipartition theorem gives for the temperature 


8 Monte Carlo simulations of isobaric or grand canonical ensembles are also widely used by adding PV or 11 N terms 
to the Hamiltonian (Frenkel and Smit, 2002). 

9 Molecular dynamics simulations of other ensembles are possible. For example, one can include extra dynamical 
variables that allow the total energy or volume to fluctuate in order to approximate a canonical or isobaric ensemble; 
see Frenkel and Smit (2002). Variants of MC and MD simulations that span the range between the two include hybrid 
Monte Carlo methods that mix MC and MD methods into one code to take advantage of the strengths of both methods. 
Alternatively, one can include Langevin random force terms and damping in an MD simulation to create coupling to a 
heat bath. 
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T in a d-dimensional system 



( 1 ) 


The virial equation of state in equations (3.7.15) and (10.7.11) can be used to determine 
the pressure P in either type of simulation: 

m - 1 + MSf (^ ■"<'«>■■'«) - 1 - 2Sf / Tr‘ rSWdr - <2) 

As discussed in Section 10.7, the pair correlation function of a fluid can be used to measure 
a variety of thermodynamic properties including the pressure, the isothermal compress¬ 
ibility Ary, and the scattering structure factor S(k)-, see equations (10.7.18) through (10.7.21). 
The pair correlation function g(r) defined in equation (10.7.5) can be determined by col¬ 
lecting a histogram of the distances between all pairs of particles periodically during the 
simulation, accounting for the periodic boundary conditions, and scaling the histogram 
by an amount proportional to the volume of shells of radius r and thickness A r. In three 
dimensions, the pair correlation function is given by 


g(r) = 


2V 

n2 {H r)[(r+Ar) 3 



(3) 


where the step function A A r(f) is unity for 0 < f < Ar and zero otherwise. This expression 
is the ratio of the number of events in each bin in the histogram compared to the average 
number that would be expected for an ideal gas with the same density. 


16.4.A Simulations of hard spheres 

The system of hard spheres has been studied extensively in both MC and MD simulations, 
and was the first model studied using either method (Metropolis et al., 1953; Adler and 
Wainwright, 1957, 1959). The pair potential for hard spheres is 


u(r) = 



for r > D, 
for r < D. 


(4) 


Temperature is an irrelevant parameter for the spatial configurations sampled by this 
model since the pair potential does not have a finite energy scale. A full exploration of 
the phase diagram involves only varying the reduced number density nD d . All thermody¬ 
namic properties are either independent of temperature, or scale with temperature in a 
trivial way. For example, the scaled pressure for a system of hard spheres P/nkT is a func¬ 
tion only of the reduced number density nD d . The hard sphere density is often expressed in 
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terms of the packing fraction rj, the fraction of the volume of the system actually occupied 
by the spheres. In three dimensions, the volume fraction is given by i] — jrnD 3 /6. Since the 
pair potential is singular, the pressure cannot be calculated using the virial equation (2) but 
the pressure can be determined using the virial equation of state for hard spheres, namely 
(10.7.12). 

An MC code for hard spheres is relatively simple since the energy change in a trial move 
is either zero or infinity. A trial displacement of a particle is rejected if the trial position of 
the particle is within a distance D of any other particle, and is accepted otherwise. This was 
the first statistical physics model ever studied in a computer simulation (Metropolis et al., 
1953). 

Implementing MD for hard spheres requires a different approach from the standard 
MD. Finite-difference integration methods will not work here since the potential is not 
differentiable. Instead, one can exploit the exact solution to the equations of motion. Each 
particle travels in a straight line at a constant velocity except at the instants when pairs 
of particles collide, that is when they are a distance D apart. Due to the singular nature of 
the potential, the collisions can be uniquely time-ordered. Each collision conserves both 
kinetic energy and momentum, so the velocities after each collision can be determined 
analytically from the velocities and the displacement vector between the centers of the two 
particles at the moment of the collision. The changes in the velocities of the two colliding 
particles are 


A Vi = — Av; = 


( r // • l 'ij) r ij 
D 2 


I ru\=D 


(5) 


where rp and Vjj are, respectively, the relative positions and relative velocities of the two 
particles. The simulation moves the particles forward in time from collision to collision 
and changes the velocities of the pairs of particles involved in the collisions, as given by 
equation (5). This was the first implementation of the MD method in statistical physics 
(Alder and Wainwright 1957, 1959). 

The pair correlation function and the structure factor in the fluid phase are shown in 
Figure 16.2 and the phase diagram for hard spheres in Figure 16.3. At low densities, the 
equilibrium phase is a short-range ordered fluid. At high densities, the equilibrium phase 
of the model is a long-range ordered, face-centered cubic solid. It is worthwhile to note 
that an attractive interaction is not required for a model to have a crystalline phase. An 
attractive interaction is, however, required for the formation of a liquid-vapor coexistence 
line and a critical point. For this reason, the low-density phase of the hard sphere model is 
often referred to as a fluid phase rather than a liquid phase since the model does not have 
a liquid-vapor coexistence line. The liquid and solid volume fractions at the liquid-solid 
coexistence line are rp ~ 0.491 ± 0.002 and rj s — 0.543 ± 0.002, respectively. The liquid-solid 
coexistence pressure is given by Pt = P) s D 3 /kT — 11.55± 0.11; see Speedy (1997). In the 
low-density fluid phase for ;; < rp, the reduced pressure is accurately modeled by the 
Carnahan-Starling equation of state (10.3.25) 


1 + ij + rj 2 — r) 3 
(1 - r,) 3 


P 

nkT 


( 6 ) 
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(a) (b) 


FIGURE 16.2 (a) The pair correlation function g(r) and (b) the static structure factor S(k) for a three-dimensional 
system of hard spheres at volume fraction j; = 0.49 from a Monte Carlo simulation of 2916 particles (M. Glaser, 
unpublished). This value of the volume fraction is in the liquid phase close to the solid-liquid coexistence line. The 
solid lines depict the pair correlation function and the static structure factor from the Percus-Yevick approximation; 
see Percus and Yevick (1958), Wertheim (1963), and Hansen and McDonald (1986). 



FIGURE 16.3 Sketch of the equilibrium phase diagram for hard spheres in three dimensions. The horizontal axis is 
the volume fraction = nnPp/% and the vertical axis is the scaled pressure P* —PDP/kT. There are two equilibrium 
phases: a low-density fluid phase for 0 < ij < rj/ and a high-density solid phase for ij s < rj < rj cp . 

although there are other good parametizations as well. In the solid phase the pressure is 
approximately given by 


P 3 tj*- 0.7072 

_—_0.5921-_ 

nkT 1 — ri* ' rj* — 0.601 


(7) 


where rj*(= r]/r] cp ) is the ratio of the actual packing fraction to the maximum close-packed 
value rj cp , namely 7 tV 2/6~ 0.7405 (Speedy, 1997; Frenkel and Smit, 2002). The pressure 
in the solid phase diverges as the density approaches the close-packed density. The model 
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also exhibits a metastable disordered phase for densities between rp and the random close- 
packed volume fraction rj rcp — 0.644 ± 0.005; see Rintoul and Torquato (1996). For a survey 
of hard sphere results, see Mulero et al. (2008). 

16.5 Computer simulation caveats 

Computer simulations are widely used in statistical physics and have played an important 
role in our understanding of many physical systems. Simulations complement theory and 
experiment and provide many advantages for the study of systems that are not amenable 
to exact or approximate theoretical analysis. However, it is important to understand the 
inherent limitation of this technique. 

• Computer simulations necessarily involve a limited number of degrees of freedom, 
typically hundreds to thousands of particles or spins. This is not nearly large enough 
to display many of the behaviors that occur in thermodynamically large systems. For 
example, a model of a dense system of 1,000 particles in a three-dimensional cubic 
box will only have about 10 particles along each linear dimension of the box, so 
correlations beyond about five particle diameters are affected by the periodic 
boundary conditions. Extraction of accurate thermodynamic behavior from such a 
study will often involve an analysis of the finite-size scaling behavior of the model for a 
sequence of systems with different sizes. 

• Computer simulations necessarily involve a limited time-scale of simulation. Typical 
molecular time scales are of the order of to ^ a/v, where a a microscopic length scale 
and v a molecular velocity. For atomic scales near room temperature, a ~ 0.1 nm and 
v ~ 100 m/s, which gives to ss 10 -12 s. In an MD simulation, each time step moves the 
system forward in time by an amount At, which must be much less than to in order 
to aptly integrate the equations of motion, say At ^ 10 -14 s. A simulation that moves 
the system forward through 10 6 time steps will sample a physical time of only 10 ns, 
which may not be sufficient to reach many important time-scales of interest in the 
problem. This is especially problematic when the system has inherently slow time- 
scales, such as the critical slowing down near second-order phase transitions and the 
hysteresis near first-order phase transitions. Monte Carlo simulations are similarly 
hampered in that the simulation must run long enough for the model to explore a 
sufficiently large region of the phase space to capture the equilibrium behavior. In 
favorable cases, this and the previous issue can be mitigated by special simulation 
methods such as coarse graining, cluster update methods, parallel tempering, 
multicanonical Monte Carlo methods, Monte Carlo renormalization group, and so 
on; see footnote 5. 

• Interactions between particles are usually highly simplified for computational 
efficiency and the interaction range is usually cut off. Long-range interactions 
(Couloub, dipole, van der Waals, etc.) need to be resummed or treated via perturbation 
theory to try to account for their effects. 
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• MC and MD simulations do not directly measure the number of microstates available 
to the system, so one cannot directly calculate the entropy or the free energy in the 
same way as other observables. If, for example, a determination of the free energy is 
necessary to locate a phase transition, then it can be determined by a thermodynamic 
integration to a state with a theoretically known free energy; see Frenkel and Smit 
( 2002 ). 

• MD simulations depend on the ergodicity of the Hamiltonian, so one-dimensional 
models, models that are weakly perturbed from mechanical equilibrium, and other 
nearly integrable models may get trapped in low-dimensional orbits that do not fully 
explore the constant-energy hypersurface of the Hamiltonian; see footnote 6. 

• All pseudorandom number generators produce some level of correlation in their 
sequences. Even subtle correlations in pseudorandom number sequences can 
produce erroneous results in Monte Carlo simulations. Different classes of generators 
have different weaknesses, so switching to a generator based on a different algorithm 
will sometimes cure a problem caused by correlations produced by a particular 
generator. Testing a generator before using it in a simulation is always a good idea; 
see Appendix I. 

• It is extremely important to confirm the validity of MC and MD simulation codes. 

This process is rather different from verifying a theoretical calculation. Some code 
evaluation and verification procedures include: testing the code initially on small 
systems with known properties, testing the code whenever possible against models 
with exact solutions or models that have been widely studied in the literature, 
examining results as a function of system size and run length, and retesting carefully 
whenever new interactions or code modules are added. In this connection, Frenkel 
and Smit (2002), Parker (2008), and Landau and Binder (2009) provide lists of good 
strategies. 


Problems 

16 . 1 . Write a code to test a uniform pseudorandom number generator. If you do not have a canned 
generator available, write a generator based on L’Ecuyer’s recommended generator in Appendix I. 
Apply the following tests: average ( x) = 1/2, variance (x 2 ) - (x) 2 = 1/12, and the pair correlations 
test (x 1+ fcX,) = 1/4 for k ^ 0. Generate a histogram of pairs of numbers on a two-dimensional unit 
square and test that the distribution is statistically uniform. 

16 . 2 . Write a code to test a Gaussian pseudorandom number generator. If you do not have a canned 
generator available, write a generator based on the Box-Muller algorithm in Appendix I. Apply 
the following tests: average (x) = 0, variance (x 2 ) = 1, and the pair correlations test (x i+ r-Xi) = 0 for 
k ^ 0. Generate a histogram of pairs of numbers in two dimensions and test that the distribution 
is statistically Gaussian. 

16 . 3 . Define a sequence of correlated random numbers 

s k = as k _ l + (1 -a)r k , 

where r k is a unit-variance, uncorrelated, Gaussian pseudorandom number while 0 < a < 1 
defines the range of the correlations. Show that this sequence is Gaussian distributed, with a zero 
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mean. Determine the variance in terms of a and compare your result with equation (16.1.2b). 
Write a code to determine the correlation function (16.1.3). Plot your measured correlation 
function and compare it to the exact correlation function. 

16 . 4 . Write a Monte Carlo code for a system of N hard spheres of diameter D on a one-dimensional 
ring of length L with periodic boundary conditions. Calculate the pair correlation function 
and compare it to equations (13.1.6) and (13.1.7). The pressure of the system is given by 
P/nkT = 1 + nDg(D + y, see equation (10.7.12). Compare your pressure to the one obtained for the 
exact configurational partition function Zjv = L(L — ND) n ~ 1 /N\\ see equation (13.1.2). 

16 . 5 . Write a Monte Carlo code for a fluid of N hard spheres in a two-dimensional Lx L square box with 
periodic boundary conditions in each direction. Calculate the pair correlation function and 
determine the scaled pressure using equation (10.7.12), namely P/nkT = 1 + 2?/g(D 1 ). Compare 
this pressure to the approximate form P/nkT = (1 + ?j/8)/(l — ij) 2 . 

16 . 6 . Write a Monte Carlo code for a fluid of N hard spheres in a two-dimensional Lx L square box and 

include a one-body gravity term m gyi in the algorithm, that is, accept otherwise legal 

configurations with probability exp(— fimgAy). You will need to use hard-wall boundary 
conditions on the top and bottom walls. Determine the average number density as a function of 
the vertical position in the box. 

16 . 7 . Write a molecular dynamics code for N Lennard-Jones particles in a two-dimensional Lx L 
square box. Apply periodic boundary conditions in each direction. Determine the scaled pressure 
using the virial equation (16.4.2). Calculate and plot the pair correlation function of the system. 

16 . 8 . Write a molecular dynamics code for N Lennard-Jones particles in a two-dimensional Lx L 
square box, and include a one-body gravity term in the energy: Yl’iLi m gyi- Apply periodic 
boundary conditions in the x-direction but a repulsive WCA (Weeks, Chandler, and Andersen, 
1971) potential on the top and bottom walls. The WCA potential is the repulsive part of a 
Lennard-Jones potential for r/D < (2) 1 / 6 , with the potential shifted up by e. Show that the average 
kinetic energy per particle is independent of the height y in the box but the average scaled density 
nD 2 depends on the vertical position in the box. 

16 . 9 . Write an MC code to simulate the one-dimensional Ising model on a periodic lattice of length L. 
Calculate the internal energy and specific heat of the model and compare them to 
equations (13.2.15) and (13.2.16). Calculate the correlation function G(n) = () and it 
compare to equation (13.2.32). 

16 . 10 . Write an MC code to simulate the two-dimensional nearest-neighbor Ising model on a periodic 
Lx L lattice in zero field. Calculate the internal energy and the specific heat of the system as 
functions of temperature and compare them to the exact results in section 13.4.A. See exact 
results for the two-dimensional Ising model for various lattice sizes at www.elsevierdirect.com. 

16 . 11 . Write an MC code to simulate the two-dimensional nearest-neighbor Ising model on a periodic 
Lx L lattice in zero field. Calculate the energy distribution P(E) over a range of temperatures 
including the critical point. Use this distribution to calculate the internal energy and the specific 
heat as functions of temperature. See exact results for the two-dimensional Ising model for 
various lattice sizes at www.elsevierdirect.com. 

16 . 12 . Write an MC code to simulate the one-dimensional AY model. Calculate the internal energy, the 
specific heat, the isothermal susceptibility, and the pair correlation function, and compare your 
results to the analytical results for the n = 2 case in Section 13.2. 

16 . 13 . Write an MC code to simulate the two-dimensional XY model. Calculate the internal energy, 
specific heat, isothermal susceptibility and the pair correlation function, and compare your 
results to the theoretical results given in Section 13.7. 


Appendices 


A Influence of boundary conditions on 
the distribution of quantum states 

In this appendix we examine, under different boundary conditions, the asymptotic distribution of 
single-particle states in a bounded continuum. For simplicity, we consider a cuboidal enclosure of 
sides a, b, and c. The admissible solutions of the free-particle Schrodinger equation 

V 2 l/r + fc 2 f = 0 ( k = ph~ i y (1) 

which satisfy Dirichlet boundary conditions (namely, ^ = 0 everywhere at the boundary), are then 
given by 


thnnir )« sin 


(£)*( 


mny\ . /nnz 

-r) sm 


c> 


( 2 ) 


with 


k = n 



l,m,n= 1,2,3,.... 


(3) 


Note that in this case none of the quantum numbers l, m, or n can be zero, for that would make the 
wavefunction identically vanish. If, on the other hand, we impose Neumann boundary conditions 
(namely, dij//dn = 0 everywhere at the boundary), the desired solutions turn out to be 


a cos 


/ lnx\ 


\ a J 


|cos (^) c ° s (_), 


mrz\ 
c )’ 


(4) 


with 


l,m,n = 0,1,2,...; (5) 

clearly, the value zero of the quantum numbers is now allowed! In each case, however, the negative- 
integral values of the quantum numbers do not lead to any new wavefunctions. 

The total number g(K) of distinct wavefunctions i//, with wave number k not exceeding a given 
value K, may be written as 


g(K)= J2f(l,m,n), 

l,m,n 


( 6 ) 
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where/(Z, m, n) — 1 for the numbers (l,m,ri) belonging to the set (3) or (5), as the case maybe; the 
summation in each case is restricted by the condition 


Z 2 


+ 




< 


K 2 


(7) 


We now define a sum 


G(K)=J2f*(l,m,n), (8) 

l,m,n 

where f*(l, m, n) = 1 for all integral values of Z, m, and n (positive, negative, or zero), the restric¬ 
tion on the numbers ( l,m,n ) being the same as stated in (7). One can then show, by setting up 
correspondence of terms, that 



the upper and the lower signs here correspond, respectively, to the Dirichlet and the Neumann 
boundary conditions. 

Clearly, the first sum on the right side of (9) denotes the number of lattice points in the ellipsoid 1 
(X 2 /a 2 + Y 2 /bi 2 +Z 2 /c 2 ) = K 2 /tc 2 , the next three sums denote the numbers of lattice points in the 
ellipses, which are cross-sections of this ellipsoid with the Z-, Y- and X-planes, while the last three 
sums denote the numbers of lattice points on the principal axes of the ellipsoid. Now, if a , b, and c 
are sufficiently large in comparison with n/K, one may replace these numbers by the corresponding 
volume, areas, and lengths, respectively, with the result 

]£3 2 ^ 

g(K ) = — ~(abc) =f — (ab + ca + bc) + — (a+b + c)^ - +E(K); (10) 

8k 4 k 8 

the term E(K) here denotes the net error committed in making the aforementioned replacements. 
We thus find that the main term of our result is directly proportional to the volume of the enclosure 
while the first correction term is proportional to its surface area (and, hence, represents a “surface 
effect”); the next-order term(s) appear in the nature of an “edge effect” and a “corner effect.” 

Now, a reference to the literature dealing with the determination of the "number of lattice points 
in a given domain” reveals that the error term E(K) in equation (10) is 0(K a ), where 1 < a < 1.4; 
hence, expression (10) for g(K) is reliable only up to the surface term. In view of this, we may write 

K 3 K 2 

g(K) = — w V -S + a lower-order remainder; (11) 

67r z 16 jt 

'By the term ‘‘in the ellipsoid" we mean “not external to the ellipsoid,” that is, the lattice points “on the ellipsoid” are 
also included. Other such expressions in the sequel carry a similar meaning. 
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in terms of e*, where 


8 mL 2 
h 2 


AT2 T^ 

e=—P 2 =—K 2 

h 2 IT 2 ’ 


equation (11) reduces to equations (1.4.15) and (1.4.16) of the text. 
In the case of periodic boundary conditions, namely 


i l/(x,y,z) = i ls(x + a,y,z) = \j/(x,y+b t z) = i /f(x,y,z + c), 


the appropriate wavefunctions are 


flmn(r) cxexp{i(k-r)}, 


( 12 ) 


(13) 


(14) 


with 

, (l m n\ 

lc = 2n l,m,n = 0,±1,±2,_ 

\a b cJ 

The number of free-particle states g(K) is now given by 

g(K) = J2 f* (l, m,n), 
l,m,n 


(15) 


(16) 


such that 


(l 2 /a 2 + m 2 /b 2 + n 2 /c 2 ) < K 2 /(4n 2 ). (17) 

This is precisely the number of lattice points in the ellipsoid with semiaxes Ka/2n, Kb/2n, and 
Kc/2n, which, allowing for the approximation made in the earlier cases, is just equal to the volume 
term in (11). Thus, in the case of periodic boundary conditions, we do not obtain a surface term in 
the expression for the density of states. 

For further information on this topic, see Fedosov (1963,1964), Pathria (1966), Chaba andPathria 
(1973), and Baltes and Hilf (1976). 


B Certain mathematical functions 

In this appendix we outline the main properties of certain mathematical functions that are of special 
importance to the subject matter of this text. 

We start with the gamma function F(v), which is identical with the factorial function (v — 1)! and 
is defined by the integral 

OO 

T(v) = (v - 1)! = J e~ x x v ~ l dx ; v > 0. (1) 

o 


First of all, we note that 


r (i) = o!= l. 


( 2 ) 
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Next, integrating by parts, we obtain the recurrence formula 

T(v) = -r(v+ 1), (3) 

v 

from which it follows that 

r(v + l) = v(v-l)---(l + p)pr(p), 0 < p < 1, (4) 

p being the fractional part of v. For integral values of v (v = n, say), we have the familiar 
representation 


T{n+ 1) = n\= n(n— 1) - - -2 -1; 
on the other hand, if v is a half-odd integral (v = m + j, say), then 


(5) 


'( m +l)^( m -l) != ( m - l)( m -|)...3,(l) r (l) = (2m X)(2 ™ m 3) "~ 3 ' 1 ^ 1 /2, (6) 


where use has been made of equation (21), whereby 

r(|),(-£)!_„'*. 


(7) 


By repeated application of the recurrence formula (3), the definition of the function T(v) can be 
extended to all v, except for v = 0, — 1, —2,... where the singularities of the function lie. The behavior 
of T(v) in the neighborhood of a singularity can be determined by setting v = —n + s, where n = 
0,1,2,... and |s| <$c 1, and using formula (3) n+ 1 times; we get 


r(-n + e) = 


1 


(-« + £)(-«+ 1+ £)■■■(-l + e)e 
(- 1 )" 


r(i + e) 


n\e 


Replacing x by ay 2 , equation (1) takes the form 


r(v) = 2a" j e~ ay2 y 2 "- 1 dy, v > 0. 


( 8 ) 


(9) 


We thus obtain another closely related integral, namely 


hv-\ = I e “ y2 y 2u l dy=^T{v), v > 0; 




( 10 ) 


by a change of notation, this can be written as 

OO 

= J 1 


I v = / e~ ay y"dy = 


2«( v+1 V 2 


v + 1 


v > -1. 


(ID 
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One can easily see that the foregoing integral satisfies the relationship 


h+2 = ~7—Iv- 
d a 


( 12 ) 


The integrals J„ appear so frequently in our study that we write down the values of some of them 
explicitly: 

1 /jr\l/2 , 1 /jt\1/2 , 3 / jt \ 1/2 


while 




7l 2a' 13 2a 2 ’ /s a 3 ’"'' 


(13a) 


(13b) 


In connection with these integrals, it may as well be noted that 

OO 

J e~ ay2 y v dy = 0 if v is an odd integer 


= 2I V if v is an even integer. 


(14) 


Next, we consider the product of two gamma functions, say T(/r) and T(v). Using representa¬ 
tion (9), with a = 1, we have 


CXJ CXJ 

rMrM - * f I m > 0 ,« > 0 . 


(15) 


0 0 


Changing over to the polar coordinates (r,0), equation (15) becomes 

tt/2 7t/2 


r(/x)T(v) = 4 J e r2 r 2 ^ +v ^ l dr J cos 2// 1 dsin 2v 1 9d0 = 2T(/r + v) J cos 2m 1 0sin 21 ' l 8dd. (16) 


0 0 

Now, defining the beta function Bin, v ) by the integral 

71 /2 


B(n, v) = 2 J cos 2/x 1 esin 2 " 1 9d9, /r > 0, u > 0, 
o 


(17) 


we obtain an important relationship: 

D/ , r (M)T(v) D/ x 
Bin, v) = ——■—- =Biv,n). 
r(n + v) 

Substituting cos 2 9 = //, equation (17) takes the standard form 


(18) 




Bin,v)= I 1 (l —f 7 ) v 1 d?j, /r > 0, v > 0, 


( 19 ) 
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while the special case n = v = \ gives 



Coupled with equations (2) and (18), equation (20) yields 

r(iH 1/2 - 


( 20 ) 


( 21 ) 


Stirling's formula for v! 

We now derive an asymptotic expression for the factorial function 

CO 

” 1221 
o 

for v » 1. It is not difficult to see that, for v»l, the major contribution to this integral comes from 
the region of x that lies around the point x = v and has a width of order Jv. In view of this, we invoke 
the substitution 

x=v + (^/v)u, (23) 

whereby equation (22) takes the form 

OO 

^d;)7 O + 1241 

The integrand in (24) attains its maximum value, unity, at u = 0 and on both sides of the maximum 
it falls rapidly to zero, which suggests that it may be approximated by a Gaussian. We, therefore, 
expand the logarithm of the integrand around u— 0 and then reconstruct the integrand by taking 
the exponential of the resulting expression; this gives 

. / v \ v C [ u 2 u 3 u A ] , 
v\ = *Jv (-) J exp - y + —U, (25) 

~V v 

If v is sufficiently large, the integrand in (25) maybe replaced by the single factor exp(— m 2 /2); more¬ 
over, since the major contribution to this integral comes only from that range of u for which \u\ is 
of order unity, the lower limit of integration may be replaced by — oo. We thus obtain the Stirling 
formula 


v\^y/{2nv){v/e) v , v » 1. 
A more detailed analysis leads to the Stirling series 


v\ = J(2 :™)(^) 


139 


571 


1 1 

+ \2v + 288v 2 ~ 51840v 3 _ 2488320v 4 


(26) 


(27) 
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Next, we consider the function In (u!). Corresponding to formula (27), we have, for large v, 


1 


1 


In(u!) = v -)— In v — v H—ln(27r) + 


1 


1 

12v 360v 3 ' 1260v 5 1680v 7 


1 1 

+ 


For most practical purposes, we may write 

ln(v!) ss (ulnv — v). 


(28) 


(29) 


We note that formula (29) can be obtained very simply by an application of the Euler-Maclaurin 
formula. Since v is large, we may consider its integral values only; then, by definition, 

n 

ln(«!) = y^(lni). 

i=i 

Replacing summation by integration, we obtain 


f 


In (n\) ~ / (In x)dx = (xlnx — x) 


X=l 


■ (nlnn — n), 


which is identical to (29) . 

We must, however, be warned that, whereas approximation (29) is fine as it is, it would be wrong 
to take its exponential and write v\ ss (v/e) v , for that would affect the evaluation of v! by a factor 
Ofv 1 / 2 ), which can be considerably large; see (26). In the expression for ln(v!), the corresponding 
term is indeed negligible. 

The Dirac 8-function 

We start with the Gaussian distribution function 

p(x,x 0t a) = 1 e- (x ~ x ° ) 2/2or2 , (30) 

V ( 27 t)g 

which satisfies the normalization condition 

oo 

J p{x,XQ,a)dx=\. (31) 

— OO 

The function p{x) is symmetric about the value Xq where it has a maximum; the height of this max¬ 
imum is inversely proportional to the parameter a while its width is directly proportional to <r, 
the total area under the curve being a constant. As a becomes smaller and smaller, the function 
p(x) becomes narrower and narrower in width and grows higher and higher at the central point xq, 
condition (31) being satisfied at all a; see Figure B.l. 

In the limit a —>• 0, we obtain a function whose value at x = xg is infinitely large while at x^xg 
it is vanishingly small, the area under the curve being still equal to unity. This limiting form of the 
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P(x-x o) 



FIGURE B.1 Gaussian distribution function (30) for different values of a. 


function p(x,xo,a ) is, in fact, the 5-function of Dirac. Thus, we may define this function as the one 
satisfying the following properties: 

(i) S(x — xo) = 0 forallx^xo, (32) 

OO 

(ii) J S(x — xo)dx=l. (33) 

— OO 


Conditions (32) and (33) inherently imply that, at x — xq , S (x — xo) = oo and that the range of integra¬ 
tion in (33) need not extend all the way from —oo to +oo; in fact, any range that includes the point 
x = Xq would suffice. Thus, we may rewrite (33) as 


Js(x — xo)dx=l HA<xq<B. 

A 


(34) 


It follows that, for any well-behaved function/(x), 


jD 

J f(x)S(x — xo)dx=f(xo) ifA<xo 


< B. 


(35) 


Another limiting process frequently employed to represent the S -function is the following: 


<5(x — xq) = Lim 


Y~*0 JT{(x — x 0 ) 2 + y 2 }' 


(36) 
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To see the appropriateness of this representation, we note that, for x^xq, this function vanishes like 
y while, for x — xq, it diverges like y _1 ; moreover, for all y, 

OO 


/ 


_ y _ dx -\ 

n{(x-x 0 ) 2 + y 2 } n 


tan 


-1 ( X-Xp ) ' 

Y 


= 1. 


(37) 


— OO 


An integral representation of the 5-function is 

OO 

5(jc-x 0 )= — [ e ik(x ~ Xo) dlc, 
2 n J 

— OO 


(38) 


which means that the 5-function is the “Fourier transform of a constant.” We note that, for x — x (l , 
the integrand in (38) is unity throughout, so the function diverges. On the other hand, for x^Xq, 
the oscillatory character of the integrand is such that it makes the integral vanish. And, finally, the 
integration of this function, over a range of x that includes the point x = Xq, gives 


OO 

x 0 +L 

— [ 

[ e mx ~ Xo) dx 

2n J 

J 

— OO 

_ x 0 -L 



gikL _ q— ikL 
2 j z(ik) 


dk= J 


sin (kL) 

--— dk= 1 

nk 


(39) 


independently of the choice of L. 

It is instructive to see how the integral representation of the 5-function is related to its previous 
representations. For this, we introduce into the integrand of (38) a convergence factor exp(— yk 2 ), 
where y is a small, positive number. The resulting function, in the limit y -> 0, should reproduce the 
5-function; we thus expect that 

OO 

8(x-x 0 )=— Lim [ e ik(x - Xo) - yk2 dk. (40) 

2n y—>0 J 

—OO 


The integral in (40) is easy to evaluate if we recall that 


while 


J cos (kx)e yk2 dk = 


.2fcoMh,^d t . 



OO 

— OO 


(41) 


(42) 


Accordingly, equation (40) becomes 

5( x-xo) = Lim —2 - e -(«o) 2 /4r 

^oV(tey) 


( 43 ) 











662 Appendices 


which is precisely the representation we started with. 2 

Finally, the notation of the <5-function can be readily extended to spaces with more than one 
dimension. For instance, in n dimensions, 

«(r) = «(*i) ■■■«(*„), (44) 


so that 


(i) 

8(r) = 0 for all r ^ 0, 

CO OO 

(45) 

(ii) 

J ' J 8(r)dx\ ■ ■ ■ dx n = 1. 

— OO —OO 

(46) 


The integral representation of 8(f) is 


S(r) = 


1 

(2n) n 


J ■■■ J e mr) d n k. 


—oo —oo 


(47) 


Once again, we may write 


8(r) = 


1 


- Lim 

(2n) n y-0 


OO OO 

/"/•* 


iOc-P-Yt? d n k 


—oo —OO 

n/2 


= Lim 


(J- Ye-^y. 


y—>0 \4ny ) 


(48) 


(49) 


C "Volume" and "surface area" of an 
n-dimensional sphere of radius R 

Consider an n-dimensional space in which the position of a point is denoted by the vector r, with 
Cartesian components (X\,.. .,x n ). The “volume element” dV n in this space would be 

n 

d n r = Y[(d x i); 

i= 1 

accordingly, the “volume” V n of a sphere of radius R would be given by 

Vn(R)= J-J f\(dXi). ( 1 ) 

n l= l 

0< J2 xf<R 2 
i= 1 

2 The reader may check that the introduction into (38) of a convergence factor exp(— y\k\), rather than exp(— yk 2 ), 
leads to the representation (36). 
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Obviously, V n will be proportional to R", so let us write it as 

V n (R) = C n R n , (2) 

where C n is a constant that depends only on the dimensionality of the space. Clearly, the “volume 
element” dV n can also be written as 


dV n = S n (R)dR = nC n R n l dR , 


(3) 


where S n (R ) denotes the “surface area” of the sphere. 
To evaluate C n , we make use of the formula 


J exp(—x 2 )dx = 7t 1 i 2 . 

— OO 

Multiplying n such integrals, one for each x z -, we obtain 

x i—°° ( n \ n 00 

n n/2 = J-J ex P {-J 2 x n Y\ (dx i) = J exp(-R 2 )nC n R n - 1 dR=nC n -= (|)!C«; 


(4) 


; (5) 


i'=i / (=1 

- v < « 

here, use has been made of formula (B.ll), with a = 1. Thus, 

2 Yl\ 




so that 


Vn(.R) — 


n n ' 2 

(n/2)\ 


R" and S n (R) = 


2n n ' 2 

T(n/2Y 


-R 


n— 1 


( 6 ) 


(7a,b) 


which are the desired results. 

Alternatively, one may prefer to use spherical polar coordinates right from the beginning — as, 
for instance, in the evaluation of the Fourier transform 

J(k) = J f(r)e ikr d n r. (8) 


In that case, 


d n r=r n ^sin^i)" 2 • • • {sm6 n _2) l drd0\ ■ ■ -ddn^dtp, ( 9 ) 

where the (9/ range from 0 to jt while <p ranges from 0 to 2 jt. Choosing our polar axis to be in the 
direction of k, equation (8) takes the form 


m = 


r n 1 (sin0i) ,i 2 ■■■(s\nO n _2) l drd9i---d6 n _2d(j). 


( 10 ) 
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Integration over the angular coordinates 9 \, 9 2 ,%,..., 2 and <j> yields factors 

where/ y (jc) is the ordinary Bessel function while B(p.,v) is the beta function; see equations (B.17) and 
(B.18). Equation (10) now becomes 


r / I \ (ll—2)/2 

f(k) = (2 n) n/2 J f{r) l—j J( n - 2 )/ 2 (kr)r n ~ l dr, 


( 11 ) 


which is our main result. 

In the limit A; —► 0, J v {kr) —* (j>krj /r(v + 1), so that 

OO 

o — W/2 r 

7(0) =- / f(r)r n ~ l dr, 

r( m/2 )J j 


( 12 ) 


consistent with (3) and (7b). On the other hand, if we take/(r) to be a constant, say 1 /(2n) n , we 
should obtain another representation of the Dirac S -function in n dimensions; see equations ( 8 ) and 
(B.47). We thus have, from (11), 


1 7 / 1 \ (n_2)/2 

S(k)= (2^J [Vr) J(n- 2 ) /2 (kr)r n ~ 1 dr. 


(13) 


As a check, we introduce a factor exp(— ar 2 ) in the integrand of (13) and obtain 

1 X (11-2)12 


1 ? / 1 / 1 \ n /2 ? 

S(fc) = Lim- H e~ ar ( 7 —) Ji n - 2 )/ 2 (kr)r n ~ l dr = Lim (-) e~ k /4 “ 

a^o(2n) n / 2 J \kr) 1 v a-*o\4na J 


(14) 


in complete agreement with (B.49). If, on the other hand, we use the factor exp (—ar) rather than 
exp (—ar 2 ), we get 


, (n +1 

S(k ) = LimT - 1 - -r. -=— ; —— 7 — 777 , 

\ 2 / {jt(k 2 + cy2))(H+lV2 


(15) 


which generalizes (B.36). 


D On Bose-Einstein functions 

In the theory of Bose-Einstein systems we come across integrals of the type 


/ x v dx 

—7 - (0 < z < 1, v > 0;z — l,u > 1 ). 

z~ 1 e x - 1 
0 


(1) 
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In this appendix we study the behavior of G v (z) over the stated range 3 of the parameter z. First of all, 
we note that 


OO 


Lim G v (z) — 

z->- 0 


J ze X x v l dx = zT(v). 
o 


( 2 ) 


Hence, it appears useful to introduce another function, g v (z), such that 

x v ~ 1 dx 


&(z)s r4 G ' (z)= i4 


I' x v L dx 

J z~ 1 e x - 1' 


o 

For small z, the integrand in (3) may be expanded in powers of z, with the result 

, °? oo oo / 2 3 

g v (z) =- / x v ~ l y (ze~ x ) l dx =y ^- = z+ — + — H— ; 

6 r (v) J *-> l v 2 V 3 V 

Q /= 1 /= 1 


(3) 


(4) 


thus, for z<l, the function g v (z), for all v, behaves like z itself. Moreover, g,,(z) is a monotonically 
increasing function of z whose largest value in the physical range of interest obtains when z —^ 1; 
then, for v > 1 ,g v (z) approaches the Riemann zeta function f (v): 


ft( 1 ) = Ejii = iW (v > i). 

i=i 1 


(5) 


The numerical values of some of the f (v) are 

jr 2 7t 4 7r 6 

f (2) = — ~ 1.64493, 4T(4) = — ~ 1.08232, t (6) =-~ 1.01734, 

s 6 s 90 945 

f (§) ~ 2.61238, ~ 1.34149, f (j) ~ 1.12673, 

and, finally, 


£ (3) ~ 1.20206, f (5) ~ 1.03693, £ (7) ~ 1.00835. 


For v < 1, the function g, (z) diverges as z —► 1. The case v = 1 is rather simple, for the function 
g„(z) now assumes a closed form: 

OO 

/ i dx , _ v I oo 

—=——-=ln(l -ze x )\ = — ln(l — z). (6) 

z~ l e x — 1 lo 

o 

As z ->■ 1, gi (z) diverges logarithmically. Setting z = e~ a , we have 

gi(e - “) = — ln(l - e _ “)->-ln(l/a). (7) 

oc—> 0 

3 The behavior of G v (z) for z > 1 has been discussed by Clunie (1954). 
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For v < 1, the behavior of g v (e “), as a ->• 0, can be determined as follows: 


OO OO 

1 r x v ~ 1 dx ^ 1 r x v ~ l dx 

r(u) J e oi+x — i ~ r(u) J a + x ' 
0 0 


Setting x = a tan 2 9 and making use of equation (B.17), we obtain 

„ r(i-v) 

gv(e-“)«—, „ (0 < v < 1). 

a l-v 


( 8 ) 


Expression (8) isolates the singularity of the function g v (e “) at a = 0; the remainder of the function 
can be expanded in powers of a, with the result (see Robinson, 1951) 


gv(e ") 


r(l-v) 


00 <— lV 

+ESr f(v_/) “ ! ’ 


( 9 ) 


£(s) being the Riemann zeta function analytically continued to all s ^ 1. 

A simple differentiation ofg y (z) brings out the recurrence relation 

4 fe(z)]s ^®' (z)=& - i{z) - (10) 

This relation follows readily from the series expansion (4) but can also be derived from the defining 
integral (3). We thus have 




LX- 

/ 


e x x v 1 dx 
(e x — z) 2 


Integrating by parts, we get 


3 r Z 

Z— Lgp(2)J = 


3 z 


r(v) 


v-V— 1 


X 1 ’ 1 |00 /' x v 2 dx 

- + (v- 1) / —- 

e x — z lo J e x — z 

o 


The integrated part vanishes at both limits (provided that v > 1), while the part yet to be integrated 
yields precisely g„_i (z). The validity of the recurrence relation (10) is thus established for all v > 1. 

Adopting (10) as a part of the definition of the function g v (z), the notion of this function may be 
extended to all v, including v < 0. Proceeding in this manner, Robinson showed that equation (9) 
applied to all v < 1 and to all nonintegral v > 1. For v = m, a positive integer, we have instead 


gm(e “) 


(—l)m— 1 
(m— 1)! 



°° (_ 1 

a m ~ 1 + ^2 - t;(m — i)a l . 

i =o 

i^m—l 


an 


Equations (9) and (11) together provide a complete description of the function g v (e “) for small a; it 
may be checked that both these expressions conform to the recurrence relation 

^g,(e-“) = -g y _i(e-“). 


( 12 ) 
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For the special cases v = §, |, and J, we obtain from (9) 

g 5/2 (a) = 2.36a 3/2 + 1.34 - 2.61a - 0.730a 2 + 0.0347a 3 + ■ ■ (13a) 

g 3/2 (a) = -3.54a 1/2 +2.61 + 1.46a - 0.104a 2 + 0.00425a 3 + ■ • •, (13b) 

g 1/2 (a) = 1.77a“ 1/2 - 1.46 + 0.208a - 0.0128a 2 + • • ■ . (13c) 


The terms quoted here are sufficient to yield a better than 1 percent accuracy for all a < 1. The 
numerical values of these functions have been tabulated by London (1954) over the range 0 < a < 2. 
The values of several important integrals involving relativistic bosons in Chapters 7 and 9 are: 


CXJ 

J* 1"<1—2«4> = - 


71 

45 ’ 


00 9 

f x 2 

J e x -1 


dx = 2f(3)~ 2.40411, 


OO 

f X 3 

J e x -l 


dx = 6f (4) = 


15 


(14a) 


(14b) 


(14c) 


E On Fermi-Dirac functions 

In the theory of Fermi-Dirac systems we come across integrals of the type 


r x v 1 dx 

F v (z) = / —TT7T (0 < 2 < oo, v > 0). 
J z~ 1 e x +1 
o 


( 1 ) 


In this appendix we study the behavior of F v ( z ) over the entire range of the parameter z. For the same 
reason as in the case of Bose-Einstein integrals, we introduce here another function, f v {z), such that 


_ 1 1 C x v 1 dx 

Mz) = —-F v (z) = —— / ■ 

T(v) T(v) J z 1 e x + 1 


( 2 ) 


For small z, the integrand in (2) may be expanded in powers of z, with the result 

, RP 00 00 / 2 „3 

f,(z) = - / x v ~ 1 y (-l) I ~ 1 (ze~ x ) l dx= y (-1 ) l ~ lZ r- = z- — + — 

F(v) J z—i l v 2 V 3 U 


(3) 


thus, for z <3C 1, the function/,, (z), for all v, behaves like z itself. 

The functions f v (z) are related to the Bose-Einstein functions g,,(z) as follows: 


f v (z) = g v (z ) - 2 1 l ’g„(z 2 ) (0 < z < 1, v > 0; z = 1, v > 1). 


(4) 
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This is useful for determining the values of relativistic Fermi-Dirac integrals needed in Chapter 9: 

/x 2 ln(l + e~ x )dx= (5a) 

J 360 

o 

f —— dx = :3 ^ ) ~ 1.80309, (5b) 

J e x +l 2 

o 

f x 3 , 7n 4 

/ - dx= - . (5c) 

J e x + l 120 

o 

The functions f,(z) and/„_i (z) are connected through the recurrence relation 

z iz lMz)] = n g) /,(z)=/, - l(z); (6) 

this relation follows readily from the series expansion (3) but can also be derived from the defining 
integral (2). 

To study the behavior of Fermi-Dirac integrals for large z, we introduce the variable 


£ = In z, 


(7) 


so that 


°o 

t . t f x v dx 

F v (e^) = r(v)f v (e^) = I ( 8 ) 

o 

For large the situation in (8) is primarily controlled by the factor (e x ~^ + 1) _1 , whose departure 
from its limiting values — namely, zero (as x ->• oo) and almost unity (as x ->• 0) — is significant only 
in the neighborhood of the point x — tp, see Figure E.l. The width of this “region of significance” is 
0(1) and hence much smaller than the total, effective range of integration, which is O(f). Therefore, 
in the lowest approximation, we may replace the actual curve of Figure E.l by a step function, as 
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shown by the dotted line. Equation ( 8 ) then reduces to 

f 


f+(e f )«* J x "- 1 


dx=*- 

V 


(9) 


and, accordingly, 




S v 

T(v + 1 ) ’ 


For a better approximation, we rewrite ( 8 ) as 

f 


Fv(.e t ) = f x 1 '- 1 


1 - 


1 


e$~ x +1 


dx + 


OO 

/>! 1 

J e*A +1 


dx 


o f 

and substitute in the respective integrals 

x = %-t] i and x = % + ri 2 , 

with the result 

f , oo 

? u f^-mY 1 drj i , fiM + nzY l dr ) 2 


v J en + 1 J 


ei2 +1 


( 10 ) 


( 11 ) 


( 12 ) 


(13) 


Since f 1 while our integrands are significant only for rj of order unity, the upper limit in the first 
integral may be replaced by oo. Moreover, one may use the same variable rj in both the integrals, with 
the result 


F V ^^1 + J 




\V-1 


-V + 2 E 


;'=1,3,5,.. 


e'i +1 


v- 1 


-dtj 


oo 

gv—l—j T —^— dr] 

J e’i+ 1 


(14) 


(15) 


in the last step the numerator in the integrand of (14) has been expanded in powers of Now, 


OO 

1 _ f rjl 

r(j'+ 1 ) J ei + l 


1 1 


d>} = 1 ~ ^TTT 4“ TTTXl ••• = ( 1 — —T I f (7 + !); 


21 + 1 31 + 1 


1 


21 


(16) 


see equations (2) and (3), with v =j + 1 and z = 1. Substituting (16) into (15), we obtain 


fv(eh = 


r(v + i) 

r 

r(v + i) 


n 2 1 7n 4 1 

1 + v(v - 1)-^ + v(v - l)(v - 2)(v - 3)- j + • 

6 f 2 360 f 4 


(17) 
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which is the desired asymptotic formula — commonly known as Sommerfeld’s lemma (see Sommer- 
feld, 1928). 4 

By the same procedure, one can derive the following asymptotic result, which is clearly a 
generalization of (17): 


/ 


<f>(x)dx 
e*-? + 1 


o 


J (p(x)dx+ 

o 


/ d<j>\ 7 jt 4 

\ d x )x=$ 360 


/ d 3 (f> 
\ dx 3 


31tt 6 / d 5 <p\ 

+ 15120 1 dx 3 + ' 

x=$ x ' X ~S 


(18) 


where <f> (x) is any well-behaved function of x. It may be noted that the numerical coefficients in this 
expansion approach the limiting value 2. 

Blakemore (1962) has tabulated numerical values of the function / v (e^) in the range — 4<£< 
+ 10; his tables cover all integral orders from 0 to +5 and all half-odd integral orders from —I 

to+f- 


F A rigorous analysis of the ideal Bose gas and 
the onset of Bose-Einstein condensation 

In this appendix we study the problem of the ideal Bose gas without arbitrarily extracting the conden¬ 
sate term (s = 0) from the original sum for N in equation (7.2.1) and approximating the remainder 
by an integral ranging from e = 0 to e = oo. We will instead evaluate the original sum as it is with 
the help of certain mathematical identities dating back to Poisson and Jacobi in the early nine¬ 
teenth century. Luckily, these identities obviate the necessity of approximating sums by integrals 
and yield results valid for arbitrary values of N (though, for all practical purposes, we may assume 
that » 1). For pertinent details of this procedure, see Pathria (1983) and the references quoted 
therein. 

We consider an ideal Bose gas consisting of particles of mass m confined to the cubic geometry 
Lx Lx L and subject to periodic boundary conditions so that the single-particle energy eigenvalues 
are given by 


h 2 

2 mL 2 


(«? + ”1 + 4 ) ’ 


(rii = 0 ,± 1 ,± 2 ,...). 


( 1 ) 


4 A more careful analysis carried out by Rhodes (1950), and followed by Dingle (1956), shows that the passage from 
equation (13) to (14) omits a term which, for large + is of order e~^. This term turns out to be cos((v - 1 )tt} F„(e~^) = 
cos((v - l)n}F v (l/z). For large z, this would be very nearly equal to cos{(v — l)jr)/z and hence negligible in comparison 
with any of the terms appearing in (17). Of course, for v = |, I,..., which are the values occurring in most of the 

important applications of Fermi-Dirac statistics, the missing term is identically zero. 

For v = 2, the inclusion of the missing term leads to the identity 

/2(^)+/ 2 (e- f )= z? 2 +t^, 
which is relevant to the contents of Section 8.3.B. 
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The sum (7.1.2) then takes the form 


jv = E (^ £ - 1 ) _1 = EE ^ 


-Ka+M 


= E e ~ Ja 

1= i 


6 l=i 


y e~i wn i y e~i wn 2 V' e ~ iwn l 

n x n 2 n 3 


( 2 ) 


where a = —fi/kT, ft = 1/fcT, and 


/3h 2 /k N 

W = -= = 7 T — , , 

2mL 2 \L 


(3) 


A (= hj■J'ljankT) being the mean thermal wavelength of the particles. 

To evaluate the sums in (2), we make use of the Poisson summation formula (see Schwartz, 1966): 


OO OO 

E /(”) = E F(q)\ F(q) = / f{x)e 2nic,x dx. 


(4) 


n =—oo cj =—oo 


The function F(q ) is, of course, the Fourier transform of the original function f(n). Choosing/(«) = 

. 2 

e -J wn } we obtain the remarkable identity 


OO 

y g -jwn 2 
n=—o o 



(5) 


It is instructive to note that the q = 0 term in (5) is precisely the result one would obtain if the 
summation over n were replaced by integration, as is customarily done in the treatment of this 
problem. Terms with q ^ 0, therefore, represent corrections that arise from the discreteness of the 
single-particle states. Using equation (5) for each of the three summations in (2), we obtain 


*v=E^n 

i=i *=i 


7 T v—\ 

717) E e 


-n^qf/jw 


© W EE 


n y=i 


e -l" 

W2 


exp 


+d2 + ^) 


7T 

jw 


L 3 


= 7E 


1=1 


v 

f 


e ]a -1“ In 2 / 2 2 2 \ 

i 3 7 T + E 73 7 E eX P(- /w K+'?2 + %) 


2 

J w 


( 6 ) 


where the primed summation in the second set of terms implies that the term with q = 0 has been 
taken out of this sum. 

The q = 0 term in (6) is precisely the bulk result for the total number of particles in the excited 
states of the system, namely Vg 3 / 2 (e~ a )/X 3 . In the second set of terms, the summation over j may be 
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carried out with the help of a straightforward generalization of identity (4), namely 

b ^ ~v 

(n) = - f(a) + -f{b) + F aib ily, F a ,bO ) = 

l=—oo 

where a and b are integers such that b > a. Applying (7) to the primed sum in (6), we obtain 


E/ 


J f(x)e ZwUx dx, (7) 

a 


oo 


oo 


j-3/ 2 g-J« e~ Y ( q) /j = 
7=1 


y^j-3/2 e -> g- y (<?)// 
7=0 


I - OO 

= E exp[-2y^)(o ! + 27ri7) 1/2 ], 

/= —OO 


( 8 ) 


where 




n 2 q 2 


nL 2 q 2 

A 2 


> 0. 


We readily note that, whatever the value of a, terms with Z ^ 0 are at most of order exp(—L/A) which, 
for L » A, are altogether negligible. As a consequence, no errors of order (A//.)” are committed if we 
retain only the term with l = 0. We thus obtain 


where 


[ g3/2 ( e ")+ 7rl/2 “ 1/2s (y)]> 


v/ e r 

SW = E w<?) = *w=3V‘ 


- ja/ 'fi + 'fl 


(9) 


( 10 ) 


while y is given by 


y = jr 1 / 2 ^ 1 / 2 —. (11) 

y A 

In view of equation (13.6.36), the parameter y is a measure of the lateral dimension L of the system 
in terms of its correlation length f — to be precise, y = L/2p We, therefore, expect that, as we lower 
the temperature of the system and enter the region of phase transition, this parameter will go from 
very large values to very small values over an infinitesimally small range of temperatures around the 
transition point. We’ll examine this aspect of the problem a little later. 

At this point, it is worthwhile to note that if the summation over q that appears in equation (10) 
is replaced by integration, which is justifiable only in the limit y -> 0, we obtain from equation (9) 


L 3 

A 3 


g3/2 (e “)+jr 1/2 or 1/2 



L 3 

A 3 


g3/z(e “) + 


1 


N 


a 


( 12 ) 
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in perfect agreement with the bulk result obtained in Section 7.1, with a <£ 1. To have an idea of the 
“degree of error” committed in making this replacement, we must evaluate this sum more accurately. 
For this, we make use of another mathematical identity, first established by Chaba and Pathria (1975), 
namely 


E' 


_E_+ *e- 2 w 

q 2 (y 2 + n 2 q 2 ) q 


— 2tc y-\ — w; + C 3 , 

y 2 


(13) 


where 


/ 


C 3 = n lim 
y^o 


E 

<? 


P -2yq 


-/ 

all q 


P -2yq 


-dq ~ -8.9136. 


(14) 


It is important to note that the constant C 3 is directly related to the Madelung constant of a simple 
cubic lattice; see, for instance, Harris and Monkhorst (1970). Now, since the second part of the sum 
appearing on the left side of (13) is directly proportional to S(y), we can rewrite (9) in the form 


N^ 


L 3 

X 3 


§3/2 (e “) + 


L 2 71 L 2 
J 2 y 2 + X 2 




q 2 (y 2 + n 2 q 2 ) ) 


(15) 


We observe that the second term on the right side of (15) is equal to 1/a — which is precisely Nq 
when a <•£ 1. The condensate, therefore, emerges naturally in our analysis and does not have to be 
extracted prematurely, as is done in the customary treatment. 

Now, in view of the fact that, for small a, 


§3/2(^)-f(^)-2 7 r 1 / 2 a l/2 ) 

equation (15) is further simplified to 


K 4 ?( 3 /2) + Ko+ g(|-^' 


q 2 (y 2 +n 2 q 2 ) 


Introducing the bulk critical temperature T c (oo), as defined in equation (7.1.24), 


7 /( 00 ) = 


h 2 


N 


12/3 


2nmk 

we obtain from (17) the desired result, namely 

3/2 


N 


-I 2/3 


=Te t(DJ ' 


,.(2if + l - -* + ?r 

\ 7 c (oc) / X 2 1 n n 


q 2 (y 2 +n 2 q 2 ) 


(16) 


(17) 


(18) 


N 0 = N 


(19) 
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The first part of this expression is the standard bulk result for No, while the second part represents 
the “finite-size correction” to this quantity. More important, while the main term here is of order 
N, the correction term is of order IV 2 / 3 . In the thermodynamic limit, the correction term loses its 
importance altogether and we are left with the conventional result following from the customary 
treatment. 

Finally, we study the variation of the scaling parameter y as a function of T ; this will also enable 
us to examine the manner in which the correlation length £ and the condensate fraction/ (— Nq/N) 
build up as we move from temperatures above T c (oo) to those below T c (oo). For this, we introduce 
a scaled temperature, defined by t = [T — T c (oc)\/T c (oc), and study the problem in three distinct 
regimes: 

(a) For t > 0, such that l»t» AT -1 / 3 , we make use of the result for a, as stated in Problem 7.3. 

Combining this result with equation (11), we get 








/« 



(20a) 

(20b) 

(20c) 


where i (= L/1V 1 / 3 ) is the mean interatomic distance in the system. 

(b) For \t\ = Of/V” 1 / 3 ), the parameter y = 0(1) and its value has to be determined numerically. At 
t = 0, this value is determined by the equation S(yo) = 2; see equations (9) and (16). We thus 
get: yo — 0.973. The correlation length £ in this regime is 0(L ) and the condensate fraction is 
OtAr 1 / 3 ). 

(c) For t < 0, such that 1 3> [f[ 3> AT -1 / 3 , we get 


y 




-1/3 

A/ 1 / 6 |t|— 1/2 , 


$ 



7 3 \ ~| 1 / 3 L 3 / 2 

U/J f /2 


m 1/2 , 





(21a) 

(21b) 

(21c) 


We thus see how, over an infinitesimally small range of temperatures OfAf” 1 / 3 ) around l = 0, 
the parameter y descends from values G( N 1 ,/3 ) to values OfAf” 1 / 6 ) while the correlation length £ 
grows from values 0(1) to values OfL 3 / 2 /^ 1 / 2 ), and the condensate fraction / grows from values 
0(AI _1 ) to values 0(1). As N oc, the transition region collapses onto a singular point f = 0 and 
the phenomenon of Bose-Einstein condensation becomes a critical one. 
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G On Watson functions 

In this appendix we examine the asymptotic behavior of the functions 

OO 

W d (4>) = f e-* x [e~ x h (x)] d dx (1) 

o 

for 0 < 0 <§; 1. First of all, we note that if we set <j> = 0 the resulting integral converges only if d > 2. To 
see this, we observe that, with 0 = 0, convergence problems may arise in the limit of large x where 
the integrand 

[e~ x I 0 (x)] d « (2nx)- d/2 (x»l). (2) 

Clearly, the integral will converge if d > 2; otherwise, it will diverge. We, therefore, conclude that 

OO 

w d ( 0 ) = J [e~ x Io(x)] d dx (3) 

o 


exists for d > 2. 

Next we look at the derivative 

OO 

KW) = -f «r**[<r*z 0 (x)] d xdx. (4) 

0 

By the same argument as above, we conclude that 


W^(0) = — J [ e x Io(x)] d xdx 
o 

exists for d> 4. The manner in which W.(ct>) diverges for d < 4, as 0 -> 0, can be seen as follows: 

OO 

Wrf(0) = - J e~y [e“l ,/ ^7o(y/0)] ~^ydy 

0 

oo 

"~ (2 f ‘~ V2 ~ d ’' 2dy W<<1> 


(5) 


r{(4-d)/2) 

(2^) rf / 2 0(4” d )/ 2 ' 
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Integrating (6) with respect to <p, and remembering the comments made earlier about W d ( 0), we 
obtain the desired results: 



(2^) _rf/2 r{(2 - d)/2}4>~ (2 ~ d)/2 + const. 

for 

d <2 

(7a) 

W d (<P) » 

(2n)~ l lx\(l/(p) + const. 

for 

d = 2 

(7b) 


. W d { 0) - (2n)~ d/2 \T{{2 - d)/2)\cp (d ~ 2)/2 

for 

2 < d < 4. 

(7c) 


For d > 4, we have a simpler result: 

W d (4>)^W d (0)-\W' d (0M, (8) 

for, in this case, both W d (0) and W.(0) exist. 

The borderline case d = 4 presents some problems that can be simplified by splitting the integral 
in (4) into two parts: 

OO 1 OO 

/-/+/■ 

o o 1 


The first part is clearly finite; the divergence of the function W' A {<p), as <p ->• 0, arises from the second 
part which, for 4> 1, can be written as 


cxj 

: f e~^ x {2n x)~ 2 xdx = — ^E\(<f>), 
J An 2 


( 10 ) 


where £i( cp ) is the exponential integral; see Abramowitz and Stegun (1964), Chapter 5. Since E\ Up) 

— In <p for (p < 3 C 1, we conclude that 

ln ^ (11) 

Integrating (11) with respect to <p, we obtain 

W4(0)^W 4 (O)-^2 0ln(l/0). (12) 

Equations (7), (8), and (12) constitute the main results of this appendix. 

For the record, we quote a couple of numbers: 


W 3 (0) = 0.50546, W 4 (0) = 0.30987. 


(13) 


H Thermodynamic relationships 

The following four equations relating partial derivatives are sometimes known as the Four Famous 
Formulae. They make it easy to derive thermodynamic relations in any one assembly or to convert 
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relations from one assembly to another; by assembly we mean the set of thermodynamic parameters 
on which a system depends. If the quantities x, y, z are mutually related, then 



Entropy S(N, V, U ) and the microcanonical ensemble 

The entropy describes a closed, isochoric, adiabatic assembly, so it is a function of internal energy, 
volume, and number of molecules: 


, 1 P , a , 

dS = —dU + -dV- -dN, 
T T T 



fl / dS \ 

T = ~\dNj U}V 


The Maxwell relations for the entropy are 

7 9(1 m \ = / d(P/T) \ 

v 9y / u, N v 91/ ) VN 

/ 9(1/T) \ ( d(n/T) \ 

V dN )u,v \ dU ) Vi n’ 

f d(P/D \ f dUi/T) \ 

V dN ) u v V aV ) U,N 


(2a) 

(2b) 

(2c) 

(2d) 

(3a) 

(3b) 

(3c) 


The entropy is determined from the number of microstates in the microcanonical ensemble 


r(N,V,U;AU) = Ti(A AU (M -U)) t (4) 

where M is the Hamiltonian of the system and A a u(x) is the step function that is unity in the 
range 0 to AU and zero otherwise. The quantity T (N, V, U; All ) here denotes the number of discrete 
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quantum states in the energy range between U and U + A U. The entropy is then given by 

S(N, V, U) = k In T (AT, V, U;AU). (5) 

The bulk value of the entropy does not depend on the value chosen for A U. 


Helmholtz free energy A(N, V,T) = U -TS 
and the canonical ensemble 

The Helmholtz free energy describes a closed, isochoric, isothermal assembly, so it is a function of 
temperature, volume, and number of molecules: 


dA = -SdT - PdV + fidN, 


S=- 1^) 
\dT) 


V,N 


\dV J rtf 


n = 


3 A 

m 


T,V 


The Maxwell relations for the Helmholtz free energy are 


as 

av 

as 

m 


T,N 


T,V 


a p 

af 


V,N 


a p\ 

3 N ) jy 


3/x 

~af 

an 

av 


V,N 


T,N 


The Helmholtz free energy is determined from the canonical partition function 


Q n (V, T) = Tr(exp(—= J e~ pu \ -^ T; r(N, V, U;AU)\dU. 


I A U 


The Helmholtz free energy is given by 


(6a) 

(6b) 

(6c) 

(6d) 

(7a) 

(7b) 

(7c) 

( 8 ) 


A(N, V, T) = -kTIn Q n (V, T). 


( 9 ) 
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Thermodynamic potential n(/x, V, T ) = -A + pcN = PV 
and the grand canonical ensemble 


The thermodynamic potential describes an open, isochoric, isothermal assembly, so it is a function 
of temperature, volume, and chemical potential: 


d n = SdT + PdV + Nd/j ., 
S = 


an\ 

377 


'-(£) 


v,n 


T,n 


T,V 


(10a) 

(10b) 

(10c) 

(10d) 


The Maxwell relations for the thermodynamic potential are 



T.n 



3S\ 

9^ / t,v 




(11a) 

(lib) 

(11c) 


The thermodynamic potential is a function only of a single extensive quantity, V, so n (/; , V, T) = 
P(/r, T)V, that is, the pressure is the thermodynamic potential per unit volume. Therefore, we can 
write simpler thermodynamic relations in terms of pressure P, entropy density s = S/V, and number 
density n = n/V: 


dP = sdT + ndfi , 


s 


ti = 


3 P 

a t 

3 P\ 

dll) T 


dS \ 


3 li/i 


3 T 


(12a) 

(12b) 

(12c) 

(12d) 


The thermodynamic potential and pressure are determined from the grand canonical partition 
function 

OO 

&(/x, V, T) = Tr(exp(-^^ + P/iN)) = e^ N Q N (V, T), 

N =0 


( 13 ) 
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with the result 


P(P, T) 


U(n,V, T) 
V 


kT 

~V 


ln(2(/x, V, T). 


(14) 


Gibbs free energy G{N,P,T) =A + PV —U- TS + PV = /iN 
and the isobaric ensemble 


The Gibbs free energy describes a closed, isobaric, isothermal assembly, so it is a function of 
temperature, pressure, and number of molecules: 


dG = —SdT + VdP + [cdN, 




P,N 


V = 


n = 


9G\ 

~dPj 

9G\ 
9iV / 


T,N 


T,P 


The Maxwell relations for the Gibbs free energy are 

' 9S 


dP 

dS 

8N 

dv 

8N 


T,N 


T,P 


T,P 


977 

dTj 


P,N 


P,N 


9/x 

9P 


T,N 


(15a) 

(15b) 

(15c) 

(15d) 

(16a) 

(16b) 

(16c) 


The Gibbs free energy is a function only of a single extensive quantity, N, so GOV, P, T) = N/i (P, T), 
that is, the chemical potential is the Gibbs free energy per particle. Therefore, we can write sim¬ 
pler thermodynamic relations in terms of the pressure P, entropy per particle s = S/N, and specific 
volume v = V/N: 


dji — —sdT + vdP, 



(17a) 

(17b) 

(17c) 

(17d) 


The Gibbs free energy and the chemical potential are determined from the isobaric partition 
function 

OO 

J e~P pv Q n (V, T)dV] 

0 


Y n (P, T) = Tr(exp(-7,W - pPV )) = ^ 


(18) 
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the cube of the thermal deBroglie wavelength is employed here to make the partition function 
dimensionless and is irrelevant in the classical thermodynamic limit. This leads us to the result 

G(N, P, T) = Nfi(P, T) = -kT\nY N (P, T). (19) 

The isobaric ensemble is often used in computer simulations to avoid two-phase regions that are 
present at first-order phase transitions. 

Internal Energy U(N,V,S ) 

The internal energy describes a closed, isochoric, adiabatic assembly, so it is a function of entropy, 
volume, and number of molecules: 


dU = TdS — PdV + fidN, 



Maxwell relations for the internal energy are 



(20a) 

(20b) 

(20c) 

(20d) 

(21a) 

(21b) 

(21c) 


Enthalpy H(N,P,S) = U + PV 

The enthalpy describes a closed, isobaric, adiabatic assembly, so it is a function of entropy, pressure, 
and number of molecules. 


dH = TdS + VdP + ndN, 

' dH'' 


T = 


V = 


P = 


dS 

dH 

dP 

dH 

m 


P,N 


S,N 


S,P 


(22a) 

(22b) 

(22c) 

(22d) 
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The Maxwell relations for the enthalpy are 



(23a) 

(23b) 

(23c) 


This assembly is often used by chemists to describe chemical reactions that take place rapidly in the 
laboratory at fixed pressure where the speed of the reaction is too fast to allow a substantial heat 
exchange with the environment. 


Magnetic free energy F(T,H ) = U{S,M) - TS-HM 

The magnetic free energy describes a system with a fixed number of magnetic dipoles, at tempera¬ 
ture T and magnetic field H. The thermodynamic relations and Maxwell relations are 


dF = —SdT — MdH, 



/9S\ _/9M\ 
\dH) T ~ \~df) H 


(24a) 

(24b) 

(24c) 

(24d) 


This assembly is defined by the fixed-field canonical ensemble where the canonical partition 
function for spins {«,•) with dipole moment n and exchange energies /,y is 

Qn(T,H ) = Tr (exp(-pM)) = ^ exp j J2 K >] s i ' S J + h J2 Sz J )’ (25) 

{*) i / 


where the coupling constants are K/j = Jjj/kT and the field coupling is h = /iH/kT. The magnetic free 
energy is then given by 


F(T,H) = -kT\nQ N (T,H). 


(26) 


Convexity and variances 

The convexity of the entropy S that follows from the second law of thermodynamics requires that 
all second derivatives of free energies have a unique sign. These derivatives are all proportional 
to variances of different statistical quantities. A few important examples are the heat capacity, the 
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isothermal compressibility, and the magnetic susceptibility: 


C V = T 


dS\ 

) V,N 



icT 2 


> o, 


1 (dn\ 1 /3 2 p\ 

KT -rfW) T -rf[^ 2 ) T 


( i \ (N 2 )-(N) 2 
\nkT ) (N) “ ’ 


XT = 



(d 2 f\ (M 2 )-(M ) 2 

\dH 2 ) T kT ~ ■ 


(27) 


(28) 


(29) 


I Pseudorandom numbers 

The random numbers used in computer simulations are not truly random but rather 
pseudorandom. They are intended to display as many of the characteristics of a random 
sequence as possible but are generated using simple integer algorithms. “Anyone who 
considers arithmetical methods of producing random digits is, of course, in a state of sin. 
For, as has been pointed out several times, there is no such thing as a random number — 
there are only methods to produce random numbers, and a strict arithmetic procedure 
of course is not such a method” (von Neumann, 1951). von Neumann’s quip was not 
intended to dissuade the use of pseudorandom numbers but rather to encourage users 
to understand how such numbers are generated, their statistical properties and potential 
weaknesses. Pseudorandom number generators are essential in computer simulations in 
many fields but a generator should not blindly be trusted unless it has been thoroughly 
tested. There is an extensive literature on theoretical methods for creating and empirically 
testing pseudorandom number generators; see Knuth (1997), L’Ecuyer (1988, 1999), and 
Press et al. (2007). 

The most commonly used and tested classes of pseudorandom number generators are 
based on integer arithmetic modulo large, usually prime, numbers. The simplest class 
of generators is the prime modulus linear congruential method that generates pseudo¬ 
random numbers in the range [1,2,1], where m is a prime number. Each new 
number is based on the previous number according to the formula 

rj = a q_imod m, (1) 

where the multiplier a is a carefully chosen and empirically tested integer less than m. 
If a is chosen properly, the generator will produce a sequence of integers that includes 
every integer in the range [l,2 ,...,m— 1] exactly once before the sequence begins to repeat 
after m — 1 calls to the generator. Pseudorandom floating-point numbers in the open range 
(0,1) are produced by a floating-point multiplication by m~ l . This produces a uniformly 
distributed set of floating-point numbers on a comb of m — 1 values between zero and one. 
The floating-point numbers 0.0 and 1.0 will not appear in the sequence. Often, in imple¬ 
mentations, the modulus chosen is less than 2 31 to allow for the integers to be represented 
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by four-byte words. Generators of this form will repeat after about two billion calls. This 
may be fine for some applications but is often inadequate for a scientific code. Even a 
modest simulation can exhaust such a simple generator. However, linear congruential gen¬ 
erators with different moduli and multipliers can be combined to give much longer periods 
and less statistical correlation (L’Ecuyer, 1988). The following algorithm to generate a uni¬ 
formly distributed pseudorandom floating-point number r using two linear congruential 
generators gives a much longer sequence before repeating than does a single generator. 

ALGORITHM 

qj = a\ qj _i mod mi 
Sj = ci 2 sj -1 mod m 2 
z=(qj-sj ) 

if z < 0, then z = z + m\ — 1 
r — zm j" 1 

The length of the period of the combined generator is equal to the product of the non¬ 
common prime factors of mi - 1 and m 2 — 1. L’Ecuyer (1988) recommended the follow¬ 
ing multipiers and moduli: mi = 2147483563 = 2 31 — 85, a\ = 40014, m 2 = 2147483399 = 
2 31 — 249, and «2 = 40692. The two individual generators do well on the spectral test and 
other tests of correlations (Knuth, 1997) and (mi — l)/2 and (m 2 — 1)/2 are relatively prime, 
so the period of the generator is 2.3 x 10 18 . This can be implemented very easily in high- 
level languages using IEEE double precision floating-point arithmetic since a,m,' < 2 53 . 
L’Ecuyer (1988) and Press et al. (2007) also showed how the generators can be imple¬ 
mented using four-byte integer arithmetic. The exact order of the pseudorandom numbers 
can also be shuffled to improve statistics (Bays and Durham, 1976). 

There are several other classes of pseudorandom number algorithms and many good 
generators available that have been extensively tested in the computing literature and are 
in wide use in the scientific literature; for summaries, see Newman and Barkema (1999), 
Gentle (2003), and Landau and Binder (2009). However, note that subtle correlations in a 
pseudorandom sequence can produce very large deviations compared to exact results, so 
care is warranted; see Ferrenberg, Landau, and Wong (1992), Beale (1996), and Figure 1.1. 
Different classes of generators have different correlation properties, so substituting a gen¬ 
erator based on a different algorithm can sometimes be a useful strategy for testing a 
computer code. 


Gaussian distributed pseudorandom numbers 

One often needs pseudorandom numbers that are drawn from a Gaussian distribu¬ 
tion centered at the origin with unit variance: P(g ) = exp (-g 2 /2) /J'Itt. The following 
algorithm for generating Gaussian-distributed pseudorandom numbers from pairs of 
uniformly distributed pseudorandom numbers is based on an algorithm by Box and Muller 
(1958). 
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FIGURE 1.1 (a) The exact energy distribution for a 32 x 32 Ising lattice at the critical temperature (solid line) and the 
distribution calculated from 10 7 configurations using the Wolff Monte Carlo algorithm with the R250 feedback 
shift-register pseudorandom number generator (error bars). 5 The distribution calculated using the Wolff algorithm 
with Numerical Recipes ran2() (Press et al., 2007) is also shown, but is almost indistinguishable from the exact solid 
curve on this scale, (b) Deviation of the Monte Carlo results from the exact distribution in units of the statistical 
uncertainty of each point. The +'s indicate the ran2() results and the x's indicate the feedback shift-register 
results. The x 2 -value for the two cases yields x 2 = 190 for 210 nonzero points and x 2 = 2.8 x 10 4 for 217 nonzero 
points, giving deviations of -0.95a and 1300a, respectively; see Section 13.4.A and Beale (1996). 


ALGORITHM 

To generate pairs of Gaussian pseudorandom numbers gi and g 2 from uniformly dis¬ 
tributed pseudorandom numbers x and y. 

REPEAT 

x = 2rand() — 1.0 
y — 2rand() — 1.0 

s — x 2 +y 2 

UNTIL s < 1 



gt =xw 

g2 = yw 


5 An example of a widely used and trusted pseudorandom number generator whose subtle correlations have been 
shown to adversely affect the results of some MC simulations is the R250 feedback shift-register generator. R250 gener¬ 
ates a pseudorandom positive four-byte integer 0 < r,- < 2 31 by taking an exclusive or of two previous random integers 
kept in a 250 element table: r,- = r ; _ 10 3 a rj_ 2 50 - As with other generators, the floating-point numbers on [0,1] are pro¬ 
duced by dividing the pseudorandom integer by 2 31 . This particular feedback shift-register algorithm is very fast and 
produces fairly uncorrelated random pairs [r^k-i] = 1/4 for i 0. Triplets are also uncorrelated {ri c ri c _iri c _j) = 1/8, except 
for one particular triplet correlation, (Ctr,t-io 3 r /t- 25 o)»which can be shown to give (6/7)(l/8) rather than the correct value 
of 1/8; see Heuer, Dunweg, and Ferrenberg (1997). All higher order correlations that involve these same triplets are sim¬ 
ilarly affected. This triplet correlation being off by 14% can greatly affect Monte Carlo results (Ferrenberg, Landau, and 
Wong, 1992; Beale, 1996). 
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Adiabatic, 285, 289 

processes, 11,16, 23,190, 340 
Adiabats 

of blackbody radiation, 206 
of a Bose gas, 190 
of a Fermi gas, 190, 271 
Adsorption, 112 
Anharmonic oscillators, 87, 88 
Antiferromagnetism, 401, 413 
Anti-Stokes scattering, 625 
Argon, 106-108, 338 

Autocorrelation functions, 596, 610-611, 
613-615, 617-618, 633-634 

B 

Bardeen, Cooper, and Schrieffer (BCS), 108, 
392-394 

Barometric formula, 175 
Baryon number density, 279, 286 
Baryon-to-photon ratio, 279, 280, 281, 283, 
289, 291, 292, 294 
BCS/BEC crossover, 393, 394 
Beta decay, 286 
Beta function, 657 
Beta-equilibrium, 285, 289 
Bethe approximation to the Ising lattice, 

427-428, 431, 433-434, 437, 466-467, 
476, 479, 490, 498-499 
Beth-Uhlenbeck formalism, 320-325, 342 
Big Bang, 275-280, 287, 290, 295 
Binary alloys, 414, 420 
Binary collision method, 331, 357, 364, 385, 
397 

Binding energy of a Thomas-Fermi atom, 
264-269 

Blackbody radiation, 65, 130, 151, 200-205, 
277, 278, 279n4, 281, 290 
Bloch’s T 3/2 -law, 465 


Block-spin transformation, 540 
Bogoliubov transformation, 363, 366, 526 
Bohr-van Leeuwen theorem, 90, 245 
Boltzmann factor, 53, 59-60, 121, 138, 342, 
501-502, 625 
Boltzmannian systems 
classical, 9, 25, 54, 143-147, 150, 152, 
299-314 

quantum-mechanical, 331, 341 
Bose condensation, 108 
Bose gas, imperfect, 355-366 
Bose liquids, energy spectrum of, 366-369 
Bose-condensed fraction, 193,194, 197 
Bose-condensed peak, 196 
Bose-Einstein condensation (BEC), 108,181, 
183-184, 188-189, 191-199, 223, 
358-361 

distribution function, 195 
functions, 181, 664-667 
with interactions, 355-358, 367, 395-396 
statistics, 108, 132, 141-152, 165-167 
Bose-Einstein systems, 133-138, 141-152, 
179-229, 320-331 
multidimensional, 519-526 
nonuniform, 373-376 
Boundary conditions, 653-655 
Box-Muller algorithm, 651, 685 
Bragg peak, 339 

Bragg-Williams approximation to the Ising 

lattice, 420-427, 433, 434, 466-467, 494, 
497-499 

Brillouin function, 75-76 
Brillouin scattering, 625 
Brillouin zone, 625 
Broad histogram method, 507, 642n5 
Broken symmetry, 423 

Brownian motion, 583, 587-605, 607-608, 619 
of harmonic oscillator, 601-603 
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Brownian motion ( continued ) 
observation of, 587, 591, 599 
of a suspended mirror, 607-608 

C 

Canonical distribution, 49, 50, 88, 89 
Canonical ensemble, 29, 39-83, 121-124, 133, 
146, 334, 678, 679 

Canonical partition function, 678, 679, 682 
Canonical transformations, 37, 63 
Carbon dioxide, 172 
Carbon monoxide, 172 
Carbon tetrachloride, 626 
Carnahan-Starling equation of state, 314, 648 
Catalyst, 168, 171, 172 
Catalytic converter, 172 
Cepheid variable stars, 275 
Chandrasekhar limit, 264 
Chemical equilibrium, 170-172 
Chemical potential, 7, 51, 91-93, 109,110,158, 
159, 161, 170, 171, 679-681 
of a Bose gas, 357-358 
of a classical gas, 19-20 
of a Fermi gas, 237, 241-242, 252-253, 
269-273 

of a Fermi liquid, 389, 399 
of a gas of phonons, 207 
of a gas of photons, 207 
see also Fugacity 
Circulation, quantized, 370-376 
Classical limit, 22, 70, 127,136-137, 141, 181, 
186, 333, 366, 623 

Classical systems, 9-16, 25-29, 54-58 
in the canonical ensemble, 54-58,145 
in the grand canonical ensemble, 137,145 
interacting, 61-65, 299-314 
in the microcanonical ensemble, 145 
Clausius-Clapeyron equation, 109-111 
Cluster expansions 
classical, 299-307 
general, 315-319 
quantum-mechanical, 325-331 
Cluster functions, 328 


Cluster integrals, 303-304, 307, 315-316, 
324-325, 329-330 
irreducible, 308, 317-318 
Coexistence line, 106-107 
Coexistence pressure, 109, 110 
Combustion, 172 
Complexions, see Microstates 
Compressibility 

adiabatic, 224, 270, 390, 468, 521, 586 
equation of state, 336 

isothermal, 104, 107, 224, 270, 335, 343, 390, 
410, 435, 442, 452, 468, 472, 474, 
521-522, 586, 647, 683 
Computer simulation, 637-651 
Condensation, 392-394, 402-407 
in Fermi systems, 392-394 
scaling hypothesis for, 468 
of a van der Waals gas, 407-411 
Yang-Lee theory of, 494 
see also Bose-Einstein condensation 
Conductivity 
electrical, 247, 627 
thermal, 247, 627 

Configuration integral, 300, 301, 306, 308, 315, 
324, 326, 328 

Conformal transformation, 569 
Continuous transition, 107 
Convexity, 682-683 
Cooper pair, 392, 393 

Cooperative phenomena, see Phase transitions 
Coordination number, 416-417, 489, 564 
Correlation function 
of an re-vector model, 484-486 
of a Bose gas, 524-525, 534 
of a fluid, 332-335, 370, 647, 649 
of the Ising model, 481, 499-500, 533 
in the mean-field approximation, 453-455, 
461 

in the scaled form, 457, 576 
of the spherical model, 515-517, 535 
oftheYY model, 527 
Correlation length, 333, 334 
of an re-vector model, 485-487, 533 
of a Bose gas, 525, 534 
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of the Ising model, 481-482, 499-500, 531, 
532 

in fluids, 333-334 

in the mean-field approximation, 454-455, 
462 

of the spherical model, 516-517, 535 
of the XY model, 526-527 
Correlations, 107, 331-340 
spatial, 137-139, 332, 583 
spin-spin, 451, 462, 480, 516-518 
statistical, 151-152, 585, 594, 596 
time, 639 

Corresponding states, law of, 409 
Cosmic Background Explorer (COBE), 277, 278 
Cosmic microwave background (CMB), 
277-279, 293, 295 
Covariance matrix, 632 
Critical behavior 

of a Bose gas, 184-188, 522-524, 535-536 
of a fluid, 467 

of the Ising model, 429-433, 480-482, 
497-500, 533, 559, 560-563 
of a magnetic system, 443-444, 448-449, 
454-456, 467 

of the spherical model, 512-515, 535 
of avan derWaals gas, 408-411, 463 
of the XY model, 526-527 
Critical curve, 562-563, 566 
Critical dimension 
lower, 570, 578 
upper, 461, 528, 570, 578 
Critical exponents 
of an n-vector model, 486-487, 530 
of a Bose gas, 522-526 
experimental values of, 437 
hyperscaling relation among, 459, 570 
of the Ising model, 480, 482, 500, 530, 559, 
560 

mean-field values of, 455-456, 530, 557-558 
of polymers, 529 

scaling relations among, 457-459 
of the spherical model, 512-513, 515-518, 
535 

theoretical values of, 529 
thermodynamic inequalities for, 438-442 


Critical opalescence, 104, 450, 583 
Critical point, 106-108 
Critical surface, 553, 555-556, 562 
Critical temperature, 107,108 
Critical trajectories, 554 

Critical velocity of superflow, 222-223, 376, 378 

Crossover phenomena, 569 

Crystalline solid, 331,338-340 

Curie temperature, 422, 430 

Curie’s law, 73, 75, 89, 238-239, 246, 426 

Curie-Weiss law, 425-426 

D 

Dark ages, cosmic, 279n4, 282, 295 
Dark energy, 279, 281 
Dark matter, 279, 295 
Darwin-Fowler method, 44, 94 
deBroglie wavelength, 107 
Debye function, 209-210 
Debye temperature, 210-211, 249 
Debye’s theory of specific heats, 210-211 
Debye-Waller factor, 339 
Decimation transformation, 541, 552, 579-580 
Degeneracy criterion, 137,156, 179 
see also Classical limit 
Delta function, 69, 139 
Density fluctuations, 104-105, 342, 586-587 
in the critical region, 586 
Density function in phase space, 25-29 
Density matrix, 115-121, 123-124, 621-622 
of an electron, 122-123 
of a free particle, 123-125 
of a linear harmonic oscillator, 125-127 
of a system of free particles, 133-138 
of a system of interacting particles, 325-328 
Density of states, 53, 57-58, 192, 234, 258 
for a free particle, 33, 272, 653-655 
for a system of harmonic oscillators, 68-69 
Detailed balance, 118, 640, 642 
Deuterium, 290, 292 
Diamagnetism, Landau, 239, 243-247 
Diatomic molecules, 88, 111, 158-168 
Dietrici equation of state, 463 
Diffusion 

Brownian motion in terms of, 591-592 
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Diffusion ( continued ) 
coefficient, 591-592, 595, 618, 627 
equation, 592, 606 
Dimensionless parameters, 645 
Dipole-dipole interaction, 89 
Dirac delta function, 69,139 
Dispersion, 602 
Dissipation, 602, 603 

Dissipative phenomena, 583, 594, 617, 619 

Doppler broadening, 174 

Doppler red shift, 275 

Duality transformation, 492-494 

Dulong-Petit law, 207 

Dynamical structure factor, 624-626 

E 

e -expansion, 563-567 

Early universe, 275-295 

Edge effects, 654 

Effective mass, 222, 246, 248, 388 

Effusion, 155 

of electrons from metals, 247-257 
of photons from a radiation cavity, 204 
Einstein function, 207 
Einstein’s relations, 583 
Einstein-Smoluchowski theory of the 
Brownian motion, 587-593 
Electron gas in metals, 247-257 
Electron-positron annihilation, 287-289 
Electrons, 282-285, 287-289 
in a magnetic field, 122-123 
Elementary excitations 
in an imperfect Bose gas, 361-366, 395-396 
in a Bose liquid, 366-376, 396-397 
in a Fermi liquid, 385-392 
in liquid helium II, 215-223 
Energy 
cells, 141 

density, 276, 278, 279n4, 281, 283, 284 
distribution, 504, 506, 507 
fluctuations, 58-61,103-105 
hypersurface, 638, 644, 651 
Ensemble average, 26, 29-31, 48, 54, 116, 122 
Ensemble of Brownian particles, 592-593, 603, 
606-607 


Ensemble theory, 25-37 
Enthalpy, 8, 681-682 
Entropy, 677-678 

of an Ising ferromagnet, 425, 465, 466, 533 
density, 279n5, 283-285, 679 
of mixing, 16-20 

in nonequilibrium states, 584-586, 627-630 
statistical interpretation of, 5, 51-52,105, 
119 

Equation of state 
of a Bose gas, 181-182, 324-325 
of classical systems, 9, 15, 64-65, 307-314, 
317-319 

of a Fermi gas, 233, 324-325 
of a lattice gas, 464, 532-533 
of quantum-mechanical systems, 317-319, 
330 

reduced, 409 

see also van der Waals equation of state 
Equilibrium 

approach to, 3-6, 583, 603-608 
average, 638, 644 

constant, 171-172, 176-178, 401, 431 
deviations from, 584-587, 632-633 
persistence of, 583 
probability distribution, 640 
Equipartition theorem, 63-64, 67, 87, 160, 

162, 169, 247, 487, 537, 602, 606, 614, 
616, 646 
Ergodic, 644 
hypothesis, 31 

Evaporative cooling, 192,194 
Exchange interaction, 413-414 
Exclusion principle, 21, 132, 231, 236, 264, 350, 
352, 604 

Extensive variables, 1-2, 8, 16-17,19 

F 

Factorial function, 655, 658 
Fermi degeneracy, 259 
Fermi energy, 234, 242, 248-249, 252, 254, 
257-258, 260, 271, 273, 399 
Fermi gas, imperfect, 379-385, 399 
Fermi liquids, energy spectrum of, 385-392 
Fermi momentum, 234, 260, 265, 387 
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Fermi oscillators, 88 

Fermi temperature, 237, 248, 258, 260, 393 
Fermi velocity, 271 
Fermi-Dirac distribution, 247-248 
Fermi-Dirac functions, 231, 381, 387, 667-670 
Fermi-Dirac statistics, 108, 132, 138, 179, 212, 
247-248 

Fermi-Dirac systems, 133-138, 141-149, 
231-273, 320-331 
n-dimensional, 272 

Ferromagnetism, 401, 411-413, 465, 488 
see also Spontaneous magnetization 
Feshbach resonance, 359, 361, 393 
Feynman’s circulation theorem, 370 
Feynman’s theory of a Bose liquid, 367 
Fick’s law of diffusion, 591 
Field-theoretic approach, 345-355 
Finite-size effects, 501, 579 
Finite-size scaling, 570-579 
First-order transition, 107 
Fixed point, 313, 528, 553-557, 559-565, 
568-569 

Gaussian, 564-565 
non-Gaussian, 564-565 
Fluctuation-compressibility relation, 338 
Fluctuation-dissipation theorem, 338, 452, 
583, 603, 617-626, 630 
Fluctuations 

in an electrical resistor, 619-620 
in an (L,R)- circuit, 616-617 
in the canonical ensemble, 47-48, 58-61, 86 
critical, 104 

frequency spectrum of, 609-617 
in the grand canonical ensemble, 103-105 
in a multiphase system, 406 
of occupation numbers, 151-153 
thermodynamic, 584-587 
see also Brownian motion; Phase transitions 
Flux quantization in superconductors, 372 
Fokker-Planck equation, 603-608 
Four famous formulae, 676 
Fourier analysis of fluctuations, 609-617 
Fowler plot, 257 
Free volume, 85 


Fugacity, 96, 101, 122, 148, 150, 180, 185, 193, 
197-198, 198, 381, 520-521 
of a Bose gas, 184-185, 358, 395 
of a Fermi gas, 252, 272, 398 
of a gas of phonons, 207 
of a gas of photons, 207 
of a lattice gas, 417-418, 532 
of a two-phase system, 102 
see also Chemical potential 

G 

Gamma function, 655, 657 
Gaussian model, 508 
Gibbs free energy, 8, 20, 51, 96, 109-111, 
170-172, 680-681 
Gibbs paradox, 16-20 
Ginzburg criterion, 460 
Goldstone modes, 488 
Grand canonical ensemble, 91-110, 646n8, 
679-680 

Grand canonical partition function, 679 
Grand partition function, 96, 103 
of an ideal gas, 148-149, 180,231 
of an imperfect gas, 306, 315, 395, 399 
of a lattice gas, 417 
of a multiphase system, 403-407 
Gross-Pitaevskii equation, 358, 359 
Ground-state properties 
of a Bose gas, 355-361, 396 
of a Fermi gas, 233-236, 239-240, 242, 246, 
261, 269, 272-273, 384-385, 390-392, 
397-398 
H 

Hard disk fluid, 334 

Hard sphere fluid, 85, 314, 333, 335, 341, 364, 
471-473, 475, 647-650 
Harmonic approximation, 169, 205 
Harmonic oscillators, 33-35, 65-70, 98, 
100-102, 125-127, 192-194, 200, 
205-206, 358, 360, 601-603, 644 
Heat capacity, 197, 199 

Heisenberg model, 414, 427, 464-465, 488, 528 
Helicity, 283 
modulus, 469 
Helium, 108 
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Helium-4,108, 290-293; see also Liquid He 4 
Helium-3 ( 3 He), 293; see also Liquid He 3 
Helmholtz free energy, 8, 20, 50, 97, 171, 678 
Hexatic phase, 340 
High temperature series, 504 
H-theorem, 1 

Hubble expansion, 276, 280 
Hubble parameter, 276-278, 280 
Hubble relation, 276 
Hyperscaling relation, 459, 461, 482, 529, 
557-558, 570 

I 

Ice, 109 
Ideal gas 

of bosons, 180-191, 519-526, 536 
classical, 9-16, 32-35, 54-58, 60-61, 64, 

98-99 

of fermions, 231-247 
quantum-mechanical, 128-132, 141-149 
Importance sampling, 638, 640 
Indistinguishability 

of particles, 20-21, 98, 119, 129, 136, 141, 179 
of photons, 65 
Inelastic scattering, 624-626 
Inertial density of excitations, 212-215, 220, 
228 

Integrating factor for heat, 95 
Intensive variables, 17, 100, 403 
Internal energy, 196-199, 335, 681 
Internal molecular field, 422 
see also Ferromagnetism 
Internal motions of molecules, 155-170 
Ionization, 157 

Irrelevant variables, 554-558, 565-566 
dangerously, 565 

Irreversible phenomena, 589, 628-629 
Ising model, 414, 417, 419-435, 464 
in one dimension, 471, 476-482, 501, 

559- 560 

in three dimensions, 527-530 

in two dimensions, 457, 488-507, 534, 537, 

560- 563 

Isobaric ensemble, 471, 473-475, 646n9, 
680-681 


Isobaric partition function, 680 
Isothermal processes, 16 
see also Compressibility 
Isotherms 
of a Bose gas, 189 
of a multiphase system, 403-407 
of a van der Waals gas, 408 

J 

Joule-Thomson coefficient, 340-341 

K 

Kinetic coefficients, 627 
Kinetic pressure, 154 
Kirkwood approximation, 466 
Kosterlitz-Thouless transition, 340, 526-527 
Kramers’ q-potential, 95 
see also q-potential 
Kramers-Kronig relation, 623 
Kubo’s theorem, 620 

L 

Lagrange multipliers, method of, 43, 201, 359 
Lambda-transition in liquid He 4 , 108,188-189, 
222 

Landau diamagnetism, 239, 243-247 
Landau-Placzek ratio, 626 
Landau’s condition for superflow, 223 
see also Critical velocity of superflow 
Landau’s spectrum for elementary excitations 
in a Bose liquid, 366-369 
Landau’s theory of a Fermi liquid, 385-392 
Landau’s theory of phase transitions, 442-445 
Langevin function, 72, 245 
Langevin random force, 646n9 
Langevin’s theory of the Brownian motion, 
593-603 

as applied to a resistor, 619 
Langmuir equation, 112 
Laser cooling, 191-192, 258 
Last scattering, 276n3, 278, 282, 294, 295 
Latent heat 
of melting, 112 
of sublimation, 112 
of vaporization, 110 
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Lattice gas, 417-420, 464, 532-534 
Law of mass action, 171,431 
Leap-frog algorithm, 644 
Length transformation, 528, 539 
Lennard-lones interaction, 645 
Lennard-Jones potential, 309, 311-312 
Light scattering by a fluid, 450 
Light-year, 276, 277 
Linear congruential method, 683 
Linear response theory, 621-623 
Liouville’s theorem 
classical, 272-279 
quantum-mechanical, 117 
Liquid He 3 , specific heat of, 189 
Liquid He 4 

elementary excitations in, 366-369 
normal fraction in, 189, 216 
specific heat of, 189 
Liquid phase, 107, 108 
Liquid-crystal, 331 
Liquid-vapor coexistence, 106-108 
Long-range interactions, 519, 569, 643 
Long-range order, 331-332, 339, 340, 420 
in the Bethe approximation, 428-429 
in a binary alloy, 465-466 
in a Bose gas, 525 
in the Heisenberg model, 464-465 
in the spherical model, 518-519 
in a square lattice, 498-499 
in the Weiss ferromagnet, 422-425 
see also Spontaneous magnetization 
Lorenz number, 247, 250 
Low temperature series, 502, 505, 506, 507n2 

M 

Macrostates, 11 
Magnetic field, 682 
Magnetic Helmholtz free energy, 678 
Magnetic systems, thermodynamics of, 77-83 
see also Diamagnetism; Ferromagnetism; 

Paramagnetism 
Magnetic trap, 191-193, 258 
Marginal variables, 555 
Markovian processes, 604 


Mass motion, 212-214, 221-222, 228, 370 
nonuniform, 370-376 
Mass-radius relationship for white dwarfs, 
263-265 

Master equation, 603-604 
Maxwell distribution, 645 
Maxwell relation, 677-682 
Maxwell-Boltzmann statistics, 1, 5, 31, 143, 
145, 147, 150, 152, 173, 200, 248 
see also Classical limit; Classical systems 
Maxwell’s construction, 404, 408, 443, 445 
Mayer’s function, 300-301, 328, 343 
Mayer’s theory of cluster expansions, 315-319 
Mean field theories, 420, 422, 427-430, 453, 
456, 497, 500, 508 
limitations of, 460-463 
Mean values, method of, 44-49 
Megaparsec, 276, 277 
Meissner effect in superconductors, 372 
Memory functions, see Autocorrelation 
functions 

Mermin-Wagner theorem, 526 
Methane, 178 

Metropolis method, 641-643 
Microcanonical ensemble, 29-35, 39, 52, 

58-61, 103, 119-121, 141-145, 643-644, 
646, 677-678 

Microcanonical entropy, 506, 507 
Microstates, 2-5, 11-14, 33, 119, 129-130, 
141-145, 638-641, 643, 644 
of an Ising ferromagnet, 425, 464, 531 
of a binary alloy, 465-466 
“correct” enumeration of, 20-22 
Migdal-Kadanoff transformation, 580 
Mobility, 583, 595, 605, 618 
Molecular dynamics simulations, 643-646 
Monte Carlo simulations, 473, 506, 529, 
640-643, 650 

Monte Carlo Renormalization Group, 642n5, 
650 

Monte Carlo sweep, 639, 642, 643 
Most probable distribution, 42, 84,144 
Most probable values, method of, 42-44, 149 
Mulholland’s formula, 162 
Multicanonical Monte Carlo, 642n5 
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N 

Neel temperature, 465 
Negative temperatures, 77-83 
Nernst heat theorem, see Third law of 
thermodynamics 
Nernst relation, 595 
Neutrino, 279n4, 282-285, 289-290 
Neutron, 285-287, 292 
Neutron scattering, 107, 338 
Newton’s equations of motion, 638, 643 
Noise, 452 

in an (/., /h-circuil, 616-617 
white, 612, 615-617, 621 
Nonequilibrium properties, 617 
see also Fluctuation-dissipation theorem 
Normal modes 
of a liquid, 211 
of a solid, 206-210, 227-228 
re-particle density, 332 
Nucleosynthesis, 279n6, 287, 290-293, 294 
Number density, 278, 279, 283, 284, 286-288, 
291, 679 

Number of molecules, 677, 678, 680, 681 
Nyquist theorem, 616 

O 

Occupation numbers, 21-22, 149-152, 202, 
206-207, 213, 219, 221, 234, 381 
One-body density, 332 
One-dimensional fluid, 471-475 
Onsager relations, 626-632 
Order parameter, 435, 437, 442, 445, 452, 469, 
471, 497, 524, 526, 529, 576 
Ortho- and para-components, 166-167 

P 

Pade approximants, 528 
Pair correlation function, 332-338, 332-336, 
338, 343, 368, 472-473, 473, 475, 524, 
647-649 

see also Correlation function; Correlations 
Pair distribution function, 138-139 
Parallel tempering, 650 
Paramagnetism, 70-76, 239-243, 247, 272 


Particle simulations, 646-650 
Particle-number representation, 351 
Partition function 

of an interacting system, 300-304, 315, 320, 
325-331 

of an re-vector model, 483-484, 533 
of a Bose gas, 357 

of a classical ideal gas, 55-58, 85, 98, 147 
of a Fermi gas, 240-241 
of the Ising model, 477-478, 489-495, 
531-533, 543-546, 549-552 
of the lattice gas, 417 
of the spherical model, 508-510, 547-548, 
579 

of a system of free particles, 133-138, 
146-149 

of a system of harmonic oscillators, 65, 66, 
68, 82 

of a system of magnetic dipoles, 70, 77, 78, 
80, 81 

of a two-phase system, 402-405 
Pauli paramagnetism, 238-243, 272 
Pauli repulsion, 645 

Pauli’s exclusion principle, see Exclusion 
principle 

Percus-Yevick approximation, 333 
Perturbation theory, 380 
Phase diagram, 105-108 
Phase equilibrium, 109-110 
Phase separation in binary mixtures, 401, 411 
Phase space, 25-29, 32-35, 38, 54, 136, 638, 
643, 644 

Phase transitions, 401-469, 500 
and correlations, 430-431, 449-456, 539, 583 
a dynamical model of, 411-417 
in finite-sized systems, 570-579 
first-order, 107, 110, 642, 650 
and fluctuations, 449-456, 460-463, 519 
interfacial, 569 

Landau’s theory of, 442-445, 467 
second-order, 107, 442, 578, 650 
see also Critical behavior 
Phonons 

in a Bose fluid, 366, 369 
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effective mass of, 228 
in liquid helium II, 214-216 
in mass motion, 212-214, 221 
Photoelectric emission from metals, 250, 
255-257 

Photons, 35-36, 200-204, 226-227 
Plasma, 178, 277, 281, 293, 295 
Poisson equation, 266 
Polyatomic molecules, 168-170 
Positron, 282-285, 287-289 
Postulate of equal a priori probabilities, 2, 29, 
119-120 

Postulate of random a priori phases, 120 
Potassium (K), 258, 259 
Power spectrum, 603, 623 
Power spectrum, of a stationary variable, 603, 
610, 612-614, 623-624, 634 
Predictability of a variable, 609, 611, 612 
Pressure, 283, 284, 679-681 
Probability density operator, 254-256 
see also Density matrix 
Probability distribution, 640 
for Brownian particles, 587-593, 604-607 
for thermodynamic fluctuations, 584-587, 
632-633 

Proton, 278, 285, 286, 290, 294 
Pseudopotential approach, 355, 357, 364, 366, 
385, 397 

Pseudorandom numbers, 640-641, 683-685 

Q 

£j>-potential, 95-96, 98 
of an ideal gas, 145 
of a two-phase system, 406-407 
see also Grand partition function 
Quantized circulation in a Bose fluid, 370-376, 
396-397 

Quantized fields, method of, 345-400 
Quantized flux, 372 

Quantum statistics, 21, 97, 107, 115-140, 150, 
238 

see also Bose-Einstein systems; Fermi-Dirac 
systems 

Quark-gluon plasma, 282n7 


Quasichemical approximation to the Ising 
lattice, 427, 431,531 

see also Bethe approximation to the Ising 
lattice 

Quasielastic scattering, 336 
Quasi-long-range order, 331, 340 
Quasiparticles, see Elementary excitations 

R 

Raman scattering, 625 
Random close-packed, 650 
Random mixing approximation, 426, 431, 432, 
464 

see also Mean field theories 
Random walk problem, 588, 592 
Randomness of phases, 120-121, 610 
Ratio method, 528 
Rayleigh scattering, 625 
Rayleigh-Jeans law, 202-203 
Recombination, 279n4, 282, 293-295 
Relativistic gas, 38, 86, 175 
of bosons, 526, 535-536 
of electrons, 260-262, 273 
of fermions, 236 

Relativistic Heavy Ion Collider (RHIC), 282n7, 
296 

Relaxation time, 594, 599, 604-605, 607, 609, 
615, 619, 634 

Relevant variables, 555, 562, 565 
Renormalization group approach, 528-529, 
539-569 

general formulation of, 552-558 
to the Ising model, 543-546, 549-552, 
559-562 

to the spherical model, 547-549, 560 
Renormalization group operator, 552, 556 
linearized, 554, 580 
Response function, 601-603, 623 
Response time, 615-616 
Richardson effect, 251-255 
Riemann zeta function, 665-666 
Rigid rotator, 37,161 
Rotational Brownian motion, 599 
Rotons 

in a Bose fluid, 367-369 
in liquid helium II, 218-223 
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Rubidium (Rb), 191, 197, 199 
Rydberg, 293 

S 

Sackur-Tetrode equation, 19 
Saddle-point method, 44, 46, 83, 509 
Saha equation, 178, 291, 294 
Scalar models, 456, 461, 482, 487, 499, 540, 578 
Scaling fields, 555, 561 
Scaling hypothesis, 446-449, 457, 468 
Scaling relations, 457-458, 461, 468, 500, 529, 
543, 558 

Scaling theory, 540-543, 555-557 
Scattering, 107, 331-340 
of electromagnetic waves by a fluid, 
449-450, 455-456 
form factor, 336 
length, 358, 359, 393, 394 
Schottky effect, 79, 254-255 
Second quantization, 345-355 
see also Quantized fields 
Second-order transition, 107 
Shape scattering, 337 

Short-range order, 331-332, 334, 427, 432, 466 
Singularities in the thermodynamic functions, 
224, 225, 357-358, 523-525 
see also Critical behavior; Specific-heat 
singularity 

Smoluchowski equation, 589 
Sodium (Na), 191 
Solid phase, 106-108 
Solid-liquid coexistence, 106,108 
Solid-vapor coexistence, 101,106 
Sommerfeld’s lemma, 236, 256, 670 
Sommerfeld’s theory of metals, 248 
Soundwaves, 205-212 
inertial density of, 212-215 
see also Phonons 
Specific heat, 198, 199 
of an imperfect gas, 395-397 
of an Ising lattice, 432-433, 478-479, 
495-497 

of an n-vector model, 486-487 
of blackbody radiation, 205 


of a Bose gas, 182, 185-187, 190, 224, 
521-524, 535 
of a classical gas, 15 
of diatomic gases, 158-168 
of a Fermi gas, 233, 237-238, 248-249, 
270-272 

of liquid helium II, 188, 222 
of magnetic materials, 77, 79, 83, 89 
of polyatomic gases, 169 
of solids, 207-212, 227, 249 
of the spherical model, 511-518 
of a system of harmonic oscillators, 68, 
87-88, 207-212, 227 
of the XL model, 526 
Specific volume, 680 

Specific-heat singularity, 447, 462, 467, 488, 
495-497 

Spectral analysis of fluctuations, 609-617 
Spherical constraint, 508, 509, 511-512, 534, 
547, 548, 579 
Spherical field, 511 
reduced, 512 

Spherical model, 487, 508-519, 533-535, 
547-549, 579 

Spin and statistics, 132, 165-168, 341, 353 
Spin degeneracy, 283 

Spontaneous magnetization, 412,415, 422, 
424, 429-430, 439-440, 443, 499 
in the Ising model, 428-429, 497-498, 533 
in the spherical model, 511-518, 534 
see also Long-range order 
Standard candle, 275, 276 
Standard pressure, 171 
Standard states, 171 
Static structure factor, 336-338, 340 
Stationary ensembles, 27, 596, 619 
Stationary variables, Fourier analysis of, 
609-617, 633-634 
Statistical potential, 138, 324 
Steepest descent, method of, 44, 509-510 
Stefan-Boltzmann law, 203 
Stirling’s formula, 658-659 
Stochastic rate equation, 640 
Stoichiometric coefficients, 170 
Stoichiometric point, 172 
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Stokes law, 593 
Stokes scattering, 625 
Structure factor, 107, 337, 339, 475 
of a liquid, 368-370 
Sublimation, 106 
Superconductor, 392 
Supercritical phase, 107 
Superfluid, 108 

density near critical point, 468 
in mass motion, 222, 370-379 
transition in liquid He 4 , 188, 222, 469 
Superfluidity 

breakdown of, 222, 376-379 
Landau’s criterion for, 222-223, 378-379 
see also Critical velocity of superflow 
Supernova, 264, 275, 295 
Supersolid, 108 
Surface effects, 653-655 
Surface tension near critical point, 469, 500 
Susceptibility, magnetic 
of an n-vector model, 533 
of a Fermi gas, 239, 242, 245-246, 272, 399 
of the Ising model, 425, 443, 467, 478, 533 
of the spherical model, 511-518, 534 
see also Singularities 
Sutherland potential, 340 
Swendsen-Wang algorithm, 641n3 
Symmetry properties of wavefunctions, 132, 
136, 346-348, 627, 630 
Symplectic, 644 

T 

Takahashi method, 473 
Temperature, 672-674, 678-680 
Thermal deBroglie wavelength (equivalent to 
mean thermal wavelength), 107,195, 
286, 294 

Thermal expansion, coefficient of, 228 
Thermal wavelength, mean, 125,135-137, 139, 
147, 179, 227, 231, 245, 300, 327, 354, 
381, 520 

Thermionic emission from metals, 251-255 
Thermodynamic limit, 1,40, 64, 70, 100, 111, 
193, 337, 403, 472-473, 475, 486-487, 
570 


Thermodynamic potential, 679-680 
Thermodynamic pressure, 7,145,152, 403 
Thermodynamic temperature, 5 
Thermodynamics of the early universe, 
275-295 

Thermostat, 644, 646 

Third law of thermodynamics, 5, 52,119, 238, 
488 

Thomas-Fermi equation, 359 
Thomas-Fermi model of the atom, 264-269 
Thomson scattering, 281 
Tie line, 106-107 
Time-of-flight, 258, 360, 394 
experiment, 194 

T 3 -law of specific heats, 210-211 
Tonks gas, 472 
Transfer matrix, 501 

method, 476, 482, 488, 501, 531-532, 579 
Transition rate, 640, 641 
Transport phenomena, 604 
Trial state, 641-642 
Tricritical point, 468 
Triple point, 106,107 
Two-body density, 332 

Two-fluid model of superfluidity, 188, 215, 370 

U 

Ultracold atomic gases, 191-199, 258-259, 
358-361, 394 

Universality, 445, 500, 568 
classes, 449, 451, 457, 463, 471, 519, 526, 569, 
571 

Ursell functions, see Cluster functions 

V 

van der Waals attraction, 645 
van der Waals equation of state, 310-311, 340, 
404, 409, 426, 436, 446, 464 
Vapor phase, 106-108 
Vapor pressure of a solid, 102 
Variance, 682-683 

Vector models, 456, 482-488, 499, 519, 528 
Verlet algorithm, 644 
Virial, 335 

coefficients, 182, 307-317, 309-314, 319-325, 
320-325 
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Virial ( continued ) 

equation of state, 335, 647, 648 
expansion of the equation of state, 182, 233, 
307-309 

theorem, 63-64, 86, 273 
Viscosity, 593-595, 601, 614, 616 
Viscous drag, 593, 605 
Volume, 662-664 

Vortex motion in a Bose liquid, see Quantized 
circulation in a Bose fluid 

W 

Water, 109,110 
Water vapor, 172 
Watson functions, 513, 675-676 
Weeks-Chandler-Andersen (WCA) potential, 
652 

Weiss theory of ferromagnetism, 412, 420-427 
see also Mean field theories 
White dwarf stars, 259-264 
Wiedemann-Franz law, 247, 250 
Wiener-Khintchine theorem, 583, 609-617 
Wien’s distribution law, 202 


Wilkinson Microwave Anisotropy Probe 
(WMAP), 277, 278, 279n4, 283n9 

Wolff algorithm, 641n3, 684 

Work function, 252, 257 

X 

XY model, 414, 526 

Y 

Yang-Lee theory of condensation, 407 

Z 

Zero-point energy 

of a Bose system, 356, 365, 396 

of a Fermi system, 234, 269, 272, 384, 398 

of a solid, 205, 227 

Zero-point pressure 

of a Bose system, 356, 365, 396 

of a Fermi system, 261-262, 272, 384, 398 

Zero-point susceptibility of a Fermi system, 
239, 242, 246 

Zeros of the grand partition function, 407 

Zeroth law of thermodynamics, 5 

Zeta function, 665-666 



